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STATISTICS AND THE PUBLIC INTEREST* 


ArRYNEss JoY WICKENS 
Bureau of Labor Statistics 


N the past few years, statistics in the United States have come to 

be used as determinants of private and public actions affecting the 
fortunes of millions of our fellow citizens. As such, they have captured 
the interest and have become the concern of many who were previously 
unaware of the existence of statistics. 

This development has come so suddenly—-and spread into so many 
fields of activity—that the statistical profession as a whole seems al- 
most unaware of what has been happening. As a profession, we are 
scarcely prepared, and certainly not organized, to meet the serious 
responsibilities placed upon us by these new uses of statistics. 

It is to these new responsibilities, and the challenge which they 
present, that I wish to direct your attention this evening. I propose, 
first, to review the new ways in which statistics are now used; then to 
discuss the high standards of statistical competence and integrity 
which these new uses require us to maintain; the new phases of sta- 
tistical training which they suggest; and the public responsibilities 
which they impose upon statistical agencies and statisticians. 

I need scarcely remind this audience that statistics have come to be 
one of the great descriptive and analytical tools of modern industrial 
society, comparable to the other new tools of science. The varied pro- 
gram of this Association at this Annual Meeting is evidence of the 
fact that statistics are applied today not merely to describe the be- 
havior of man as a social and economic animal, but also to describe 
plant and animal life, the wlather, and, indeed, the shape of the uni- 
verse. 

Statistics of a sort can, of course, be traced back to ancient times, 
but they have flowered since the industrial revolution. Beginning in 


* Presicential Address at the 112th Annual Meeting of the American Statistical Association, Chi- 
cago, December 28, 1952. 
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the 19th century, statistical records were developed to describe the 
society of that era, and to throw light on its economic and social 
problems. No doubt they influenced the course of men’s thinking then, 
and even, in some instances, may have led to new policies and new 
laws; but primarily their uses were descriptive. 

Increasingly, in the 20th century, and especially since World War I, 
statistics have been used to settle problems, and to determine courses 
of action. In private enterprise, quality control tests now change the 
production lines of industrial enterprises. New products are developed 
and tested by statistical means. Scientific experiments turn upon sta- 
tistics. The management of industry today employs statistics in many 
ways as a guide to internal operations. 

Their use in the sphere of public economic policy-making has ex- 
panded almost unbelievably. Beginning with the 1920’s, we have seen 
- the development of a whole battery of economie statistics, collected in 
great detail, by industry, area, and product, and summarized in the 
familiar indexes and aggregates designed to measure changes in the 
economic cycle, e.g., the gross national product; the indexes of indus- 
trial production; employment and unemployment; financial statistics; 
price data of many kinds; wages; the volume of trade and commerce— 
and so on through a long catalogue. As time has gone on, these reports 
have been compiled in more and more detail, with greater frequency 
and improved quality. 

These statistics come from many sources—from individual business 
firms and industries and from the professions and their associations; 
from labor organizations; from schools and colleges; and from private 
research organizations of many kinds. Because they are often nation- 
wide and necessarily costly, they have been compiled to an increasing 
extent in the past two decades by the Federal Government. Some of 
these statistical producing agencies are old, tried, and experienced, and 
their standards of performance are well known; others are virtually 
unknown and untested. No license is required to produce statistics. 
These agencies produce statistics for widely differing purposes and, con- 
sequently, some of these summary statistics are finer, more precise 
instruments than others. But for the most part, these summary eco- 
nomic indicators were intended primarily as analytical tools, for quite 
broad economic analysis. | 

For example, they have long been used by the Congress and by 
executive agencies of the government in making laws and determining 
policies, as well as by private firms and individuals in making their 
decisions. As a rule, they have been only one of many factors con- 
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sidered in reaching a decision, and since they were designed as ap- 
proximations, they are suitable for just such general purposes as these, 
where a fairly wide tolerance is acceptable. 

World War II, however, made sudden new demands upon statistics 
for purposes of regulation and control. (There had been some other 
instances of this kind previously, but they had not been so important., 
All at once, detailed economic statistics were in demand for determina- 
tion of allocations of materials or of shipping space, or for price or wage 
regulation, or for other control purposes. Those statistics at hand were 
seized upon, whether or not they were really suitable for control pur- 
poses. If no statistics were available, a special-purpose survey was 
usually hurriedly devised, often with samplirg errors of undetermined 
magnitude. Here, a mistake in classification, in tabulation, or, most 
important, in the framing of the questions or in the interpretation of 
the data, had immediate repercussions upon business firms, their em- 
ployees, and the defense program itself. The statistical profession, it is 
clear, was quite unprepared for the war emergency. Yet those special 
wartime statistics were discontinued, with considerable ent) usiasm, 
as soon as controls ended, only to be partially and hastily reassemble ’ 
for the Defense Program in 1950. Clearly, economis?s and statisticians 
have still not provided the public with a design for an adequate, 
integrated economic intelligence system, based upon sound statistical 
standards for use in such emergencies. We have not even provided a 
first blueprint of such a plan. 

In still another field, statistics have been used more and more as 
evidence admissible in court. Both detailed studies relating to particu- 
lar firms or industries and broad economic indicators appear in at- 
torneys’ briefs and exhibits. Statistics are also used by the Federal 
Trade Commission and the Department of Justice in deciding whether 
to prosecute a case. The so-called concentration ratios employed in 
anti-trust prosecutions are an example. 

Still more recently, statistics have come to be used as automatic 
regulators or governors, in the engineer’s use of that term. They have 
been written into law as “trigger figures,” determining automatically 
whether certain actions take place. They have been written into con- 
tracts between private firms and individuals, and between governments 
and private individuals. They have thus come to determine what prices 
millions of farmers get for their crops, or other millions of workers get 
for their day’s work, or the amount of taxes a business enterprise must 
pay, or what actual dollar price a producing company receives for a 
new generator. 
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Consider for a moment some of these statistical governors. We are 
familiar with them, but not necessarily with their new uses. The oldest 
in the United States is the Census, under the Constitutional provision 
whereby the number of representatives for each State in the House of 
Representatives shall be determined by the Congress on the basis of a 
Census of the Population taken: once every 10 years. Thus, since the 
Census of 1950, more representatives come from California and fewer 
from Pennsylvania. These Census calculations are generally accepted, 
since this is their primary purpose. 

Then, in point of time, there is the parity index, another specially 
constructed statistical governor. It was devised by Congress after long 
and bitter agricultural depression, as a yardstick against which to 
measure farm prices or the farmer’s fair share of the national product 
in comparison with the industry from which he buys and to which he 
sells. This yardstick—so complex that few economists and too few 
farmers understand it—is a ratio of indexes of prices paid by farmers 
to prices received by farmers, hooked back to a pre-World War I base 
of 1910-14. Today, price supports covering some 40 percent of the sales 
value of farm commodities are based upon the parity index and parity 
prices. Congress, having fixed the base period of 1910-14 by law, later 
amended the Act to add farm wages to the index of prices farmers pay, 
and to more or less modernize the price relationships as between the 
individual farm commodities. But for the most part, the parity index 
continues, with occasional quiet revisions by its custodians in the Bu- 
reau of Agricultural Economics, as one of the main factors governing 
the receipts of millions of farmers. 

Then there are several statistical measures of which the Bureau of 
Labor Statistics is custodian, which have similar uses as “triggers” or 
governors, only one of which was especially compiled for such purposes. 

To take the most recent example first: The Bureau’s statistics on 
housing starts have just been used, under an Act passed by the Con- 
gress in the summer of 1952, as a determinant of the time at which the 
Federal Reserve limitations on housing credit under the so-called 
“Regulation X” were suspended. These statistics were originally de- 
signed as a descriptive measure of the approximate number of new 
dwellings started in non-farm areas. They were then not calculated on 
the basis of a seasonally adjusted annual rate, as called for in the Act 
passed by the Congress, and so a difficult and necessarily rough sea- 
sonal calculation had to be made. Moreover, the preliminary form in 
which they are compiled has sizable potential errors for three months, 
until final returns come in. Yet the Act provided that they were to be 
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used as “trigger figures” to suspend these particular credit regulations. 
It is fortunate that the level of home-building activity in the summer 
and early autumn of 1952, when this Act became effective, fell so far 
short of the seasonally adjusted annual rate of housing starts specified 
in the Act that the decision to suspend Regulation X could be made 
with statistical confidence. It could easily have been otherwise. 

A similar “trigger” use of the Consumers’ Price Index was considered 
when the Defense Production Act of 1950 was being debated in the 
Congress. In fact, this particular provision once passed the House of 
Representatives. It provided that price and wage controls during the 
defense emergency were to go into effect when the Consumers’ Price 
Index had advanced by 5 percent from a given date. However, it was 
pointed out to members of the Congress that the Consumers’ Price 
Index, being made up of retail prices, would move late and slowly, 
and that prices of raw materials and industrial products might have 
sky-rocketed before the Consumers’ Price Index had moved by the sug- 
gested amount. By then it would have been too late for effective 
price control measures. Consequently, this particular proposal was 
dropped. 

The venerable Wholesale Price Index of the Bureau of Labor Sta- 
tistics has also been extensively used as an escalator in business con- 
tracts during and since World War II. When this Index was revised 
early in 1952, it was found that thousands of private contracts involv- 
ing billions of dollars contained escalator clauses by which the amount 
to be paid by the purchaser varied automatically with the wholesale 
index or some component of it. These contracts include government 
contracts for such equipment as ships, which require several years to 
construct; many private contracts for heavy equipment or mainte- 
nance, such as generators or elevator maintenance; long-term leases on 
commercial and industrial properties; public utility rates in one State; 
and a variety of other contracts. 

In certain of these contracts in which the materials component varies 
with wholesale prices of materials, the wage component escalates on 
average hourly earnings, as compiled by the Bureau of Labor Statistics 
in connection with its nation-wide monthly survey of manufacturing 


_employment, hours and earnings. This compilation of earnings, while 


it is very comprehensive, is not intended to be accurate to the last 


penny. 

Another statistical series in the Bureau of Labor Statistics used for 
escalation purposes is the LIFO (Last In First Out) price index, espe- 
cially compiled at private expense for the use of department stores in 
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calculating allowable changes in inventory values for tax purposes, 
under an agreement with the Bureau of Internal Revenue. 

Then there is the well-known use of the Consumers’ Price Index in 
wage contracts, under which wages change immediately and automati- 
cally at regular intervals, usually quarterly, whenever there is a speci- 
fied change in the index. There are also many instances of agreements 
in which wages or other income payments are subject to renegotiation 
or reconsideration upon a specified change in the index. In addition to 
wage contracts, the Consumers’ Price Index is used in many other ways 
as an escalator—for example, in leases for residential buildings, for pay- 
ments for family maintenance, such as relief payments and alimony 
payments. These are private contracts negotiated by industry and 
labor and by private individuals, usually without consultation with 
the agency which issues the figures. Some of them are fairly long-term, 
like lea « s for residential buildings, or in the wage field, like the General 
Motors five-year contract. Others are for only a year or two. 

In addition to these governmental statistics, there are a number of 
privately compiled yardsticks which serve similar purposes. 

Review, now, the effect that these statistics have. A State gains or 
loses a Congressman; farmers do or do not get loans on their crops; 
an industrial producer does or does not get a higher price for his prod- 
ucts; a tax bill diminishes or increases; wages go up or down. It is no 
exaggeration to talk of hundreds of millions or even billions of dollars 
being involved. Let me be specific: Take the case of wage contracts 
tied to the Consumers’ Price Index. 

In most of these wage contracts, a change of 1 point or 1.14 points 
on the present index, which in 1952 has varied around 190 percent of 
the 1935-39 average, means a change of one cent an hour in wage rates. 
An increase of one cent an hour for a 40-hour week for 50 weeks out of 
the year for 3,500,000 workers known, at a minimum, to be under 
such contracts, means $70,000,000. This does not count millions of 
other employees whose wages or salaries may follow suit along with the 
index. Testimony before a Subcommittee of the House Committee on 
Education and Labor, under the chairmanship of Congressman Steed, 
which held hearings on the Consumers’ Price Index in 1951, indicated 
that one cent an hour totaled $8,000,000 a year for the General Motors 
Corporation alone, and $30,000,000 for the railroads, which have over 
1,000,000 employees under such contracts.! The tables in these wage 
contracts are set up in brackets of 1 or 1.14 points, by tenths of a point 





1 House Committee on Education and Labor, Subcommittee to Study Consumers’ Price Index, 82d 
Congress, Ist Session, Subcommittee Report No. 2, 1951. 
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on the index. Thus, at the margins of those wage brackets, if the index 
rises or falls by one-tenth, hundreds of millions of dollars are immedi- 
ately involved. 

Uses of statistics such as these imply a faith in statistics of which 
statisticians can be very proud. If statistics and statistical agencies in 
the United States were not trusted or had not earned a right to public 
confidence, all this would never have happened. We accept the acco- 
lade, but like the Lord Chancellor in Jolanthe we say, a little wryly, 

“But though the compliment implied 
Inflates me with legitimate pride, 


It nevertheless can’t be denied 
That it has its inconvenient side.”’ 


Such responsibility as this is very sobering. 

These statistics were usually created, as I have said, for descriptive 
and analytical purposes, and machinery for their production was geared 
to more or less leisurely uses by people who were assumed to be reason- 
ably familiar with their construction and their technical limitations. 
The checking devices, the revision programs, the publicity plans, were 
not laid out with a view to such awesome uses as these, by a wide, often 
unknown, and statistically unsophisticated public. The producing 
agencies have had to change their practices; to speed up their calcula- 
tions; to guard their results from “leaks”; and greatly to expand their 
public consulting machinery. As never before, they are operating in a 
goldfish bowl. They are being advised by numerous committees ap- 
pointed from the ranks of industry, labor, and technical groups. These 
advisory committees convey the statistical needs of the groups from 
which they come to the governmental agencies, and often provide tech- 
nical advice as well. This type of advisory committee should be con- 
tinued and expanded if government statistics are to continue to serve 
broad public uses. 

I can say from personal experience that, while these governmental 
agencies did not seek the exceptional responsibility which the new 
uses of our statistics have introduced, they have taken the responsibil- 
ity very seriously. In the opinion of impartial observers, such as the 
Task Force headed by Professor Frederick C. Mills for the Hoover 
Commission, the important statistical series produced in government 
are now administered with competence and honesty and a good deal 
of skill. That is not to say that they could not be improved, for they 
could be. But the statisticians who make these statistics know better 
than anyone that they were never intended to be accurate to the last 
decimal, as many of their users believe them to be. 
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By this I do not mean that these statistics are not correctly calcu- 
lated arithmetically. As I have just indicated, I believe they are. Arith- 
metic is easy to check, and statisticians as a rule do arithmetic well. 
Most of these major economic indicators are either double-calculated 
by two different sets of clerks, or they have built-in mechanical checks 
of various kinds. Of course, there are bound to be occasional errors. No 
system is infallible. But those errors are accidental. So don’t worry 
about that last decimal! The figures will usually be arithmetically cor- 
rect, according to the ground rules laid down for the calculation in 
question. You cannot “rig” a summary figure with 150,000 prices, or 
millions of people, in it, calculated by a whole battery of individuals, 
even if s»meone wanted to—and I do not believe they do. 

Rather, I am referring to accuracy in its statistical sense. This de- 
pends upon the ground rules. The question is: Was this statistical 
series designed, like the standard meter or the inch, to be accurate to 
a very small fraction of a point? For most of these general-purpose 
national economic indicators, the answer, of course, is “No.” I repeat— 
no such accuracy was needed for the purposes for which these statistics 
were designed. It is not built into the calculations. But, given careful 
observance of the ground rules, a summary index is usually as correct 
one time as another, and hence will measure changes from time to time 
with equal accuracy, except in most unusual circumstances. It must be 
accepted as it stands, and the public must know that. 

The next question, then, is: Did the users choose the proper statis- 
tics? Are these indicators really suitable for some of their new uses? 
Obviously, that question can be answered only with reference to speci- 
fic cases. But take the major ones: Of the indicators I have named, 
only the Census, the parity indexes, the LIFO index, and some of the 
privately-constructed indexes were designed for the specific purposes 
which they now serve—and none of them is accurate to the last digit, 
nor is that kind of accuracy required, in my opinion. 

However, in many cases, I believe that the other, more general, in- 
dicators have also served their new uses well. Most of the statistics 
which are currently used as automatic “governors” are price indexes, 
except for the residential construction statistics. They have, in effect, 
provided insurance to the parties to contracts against wide swings in 
the price level. Without discussing the more fundamental economic 
questions involved in these escalator clauses, may I merely say that 
there has rarely been a time in our recent history when insurance 
against price changes was so appropriate, or when the price swings were 
so wide that an approximate indicator was good enough to deal out 
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rough justice to the parties concerned, provided they chose the ap- 
propriate statistical series in the first instance. And for many of the 
large contracts this has been true. 

Now, given these new, more weighty, and more widespread uses of 
statistics, what do we, as statisticians, owe the trusting public which 
accepts our products on faith, often with a fateful sentence which be- 
gins, “Statistics prove that...” 

We owe the public, I believe, these things: Arithmetic accuracy ; con- 
tinuing observance of the ground rules for compilation, or a warning 
if they are to be changeu; fairness and impartiality in handling the 
results; a clear description of the general nature and limitations of sta- 
tistics, with a simple measure of their accuracy, if it can be measured, 
and a warning if it cannot. We cannot, and we should not, try to ex- 
plain every technical detail, or to make statisticians of everyone, any 
more than all of us should try to understand all the mechanical details 
of how our automobiles work. 

We owe the public also, if we are asked, advice as to whether a cer- 
tain statistical measure is suitable for a given use or not, and an educa- 
tion in the fact that statistics must and will change. They cannot be 
expected to be compiled in exactly the same way indefinitely. They get 
out of repair and go out of fashion, just like other products, if they are 
not revised from time to time, and so responsible statistical agencies do 
revise them. What would you think of a city consumers’ price index 
which today had the lamp chimneys and high-button shoes of 1900? 
You may be sure that some of the articles in the indexes of today will 
look just as peculiar in 25 years. Therefore, anyone making a contract 
or setting up any other long-term use for a statistical series should be 
told that he should make provision for periodic shift-overs to revised 
statistics, if he does not wish to upset the contract. In my opinion, it 
would be wise also to provide for periodic review of whether circum- 
stances have so changed that other measures—or no statistical meas- 
ures at all—would be preferable. For example, it should be remembered 
by those who employ price indexes in contracts that in periods of price 
stability, which have not been uncommon in our history, such escala-. 
tors as these indexes provide may either not be needed, or since they 
are only approximate measures of trends, they may not be accurate to 
a sufficiently fine point so that their changes have any real significance. 

The public should know, too, that there is no real assurance that 
the statistics they use today will be issued at all 10, 15, or 50 years 
from now. Although they are authorized by law, there is nothing ex- 
cept custom and the enthusiasm of their sponsors to keep them going, 
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except, as in the case of the Census, or the parity index, where they 
have a statutory directive. Other statistics may be discontinued be- 
cause they have served their primary purpose—although this is un- 
likely to be true of the major economic indexes—or they may be 
changed or eliminated because of lack of basic data or of financial or 
organization support. In general, in government, as in private agencies, 
budgets are reviewed each year and they are often changed, with con- 
siderable danger to the continuity of statistical service. As the Steed 
Subcommittee of the House Committee on Education and Labor ob- 
served in its report on the Consumers’ Price Index in 19512 
“The subcommittee believes that the Consumers’ Price Index has become 
so important that it must be regarded as a fixed charge spon the Govern- 
ment; it should not be subject to yearly fluctuations in budget and at the 
same time be required to do the same amount of work.... Unlike some 
Government programs, the issuance of a statistic of this type depends al- 
most completely upon continuity of effort. It is impossible to cut the work 
one year and increase it the next on the same project. ... ” 


Therefore, the users of statistics and the profession must be educated 
to take some responsibility for continuity of statistics where that is 
important—for today it is not assured. 

Finally, statisticians owe the public, I believe, absolute assurance of 
continued competence, of honesty and fairness in the calculation of 
these statistics—and indeed of all major statistics, whether they are 
“trigger figures” or not. Note, I say “assurance.” I mean that we, as 
statisticians, need not merely to be competent, fair, and honest, but we 
need to be able to prove to a statistically unsophisticated public that, 
in fact, our statistics are trustworthy. 

This is not merely a public relations problem, although its public 
relations aspects are very important, but one of fundamental, long- 
run institutional changes which will assure that statistics continue to 
serve in the public interest. I believe that the statistical profession and 
its sister professions in the social and the natural sciences, where sta- 
tistics are used, should immediately take the initiative and assume 
responsibility for this public assurance. 

Consider what we do not have in our profession, which other profes- 
sions have. There is in our profession, for example, nothing comparable 
to the Certified Public Accountant, or “Member of the College of 
Surgeons.” There is no such label as a “Certified Public Statistician.” 
There is no certification of the source of statistics like the stamp “U.S. 





2 House Committee oa Education and Labor, Subcommittee to Study Consumers’ Price Index, 82d 
Congress, 1st Session, Subcommittee Report No. 2, 1951, p. 39. 





ey 
De= 
In- 
be 


eS, 
n-~ 
ed 
b- 


ne 
ae 
he 
ne 
i]- 
rk 


oe 





STATISTICS AND THE PUBLIC INTEREST ll 


inspected—choice, good or commercial,” and few statistical brand 
names that mean anything except to the statistically sophisticated. 
Moreover, for our products, there cannot be standardization, and there 
is only voluntary labeling of the contents, in footnotes which are often 
too technical for the public. Thee is no audit comparable to the audit 
of a firm of certified public accountants, invited by a corporation to 
audit its books, in order to assure the stockholders and the public not 
merely of the accuracy of the arithmetic of its accounting, but also 
of the validity of the way in which the books are kept. 

The time has come, in my opinion, for the statistical profession, in 
its own interest, to consider devising some means by which these func- 
tions can be performed for important statistical series. Such a sugges- 
tion implies no doubt of the present validity of existing statistics, nor 
should it be considered to reflect in any way upon the professional 
standing or integrity of the statisticians concerned. An accountant 
does not feel incompetent because he is not a CPA, but for certain work 
a CPA is required; a company is proud of a “U.S. inspected” label; an 
accountant does not feel that it reflects on his integrity because an out- 
side accounting firm audits his books. In fact, he welcomes the verifica- 
tion of his procedures and his accuracy. And so should statisticians. 

What are those guarantees? What underlies them? 

The fundamental guarantee of integrity of any statistics, now or in 
the future, in public or private agencies, as the Mills Committee® ob- 
served, lies in the quality and the competence of the people who com- 
pile them. While there is no statistical equivalent of admission to the 
bar, over the past three decades we all know that there has been a great 
improvement in the quality of statistical work; in basic statistical train- 
ing in the colleges; and in standards set for statisticians in public and 
private agencies. There are now advanced degrees in Statistics, with 
fine training in methodology. In the public service, this Association has 
helped the Civil Service Commission to define standards for various 
grades of work. What is now most lacking in statistical training in col- 
leges, I believe, is an adequate appreciation of the fact that statistics 
are tools to be applied to subject matter. A highly trained mathemati- 
cal statistician without knowledge of the subject can be quite as dan- 
gerous as a subject-matter specialist trying to use statistical methods of 
which he has little knowledge. Statistical students should therefore be 
given a well-rounded training in both areas. Advanced statistical train- 





* The Statistical Agencies of the Federal Government, A Report to the Commission on Organiza- 
tion of the Executive Branch of the Government, by Frederick C. Mills and Clarence D. Long, of the 
Research Staff of the National Bureau of Economic Research, 1949. 
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ing should certainly involve practical field work experience wherever 
that is at all feasible. In view of the fact that statistics today are big 
business, students should be taught how the important, large-scale 
statistics are made, and how the individual statistician’s role is often 
to serve as one cog in a big, intricate, mass-production process. Teach- 
ers will also do well to give emphasis to an uncodified, but none-the- 
less real, standard of ethics, which demands of the statistician accuracy, 
impartiality, and integrity in all of his work. 

Beyond good training, comes experience in formulating a statistical 
project, defining the terms, managing a study. The crucial nature of 
this area of statistical administration is too little recognized. These new 
uses of statistics make it all important. 

With these considerations in mind, I hope that this Association will 
take positive action to carry out some of the excellent suggestions on 
training made by Samuel Wilks when he was president of this Asso- 
ciation; and at the same time, consider again the possibility of setting 
up examinations for Certified Public Statisticians, remembering that 
that label should only be applied to those who have been proved cor- 
petent in method, in subject matter, and in application. 

Finally, I come to the question of reassuring the public that statistics 
which are so widely used in the public interest are competently and 
fairly compiled, and are designed to meet public needs adequately. It 
is time, I believe, for the professiun as a whole to share some responsi- 
bility for these statistics with those who make them. 

Since so many of the important statistical governors and statistics 
used for policy-making purposes in the United States originate in Fed- 
eral statistical agencies, we might first consider how this problem 
could be handled for government statistics. (And here I must emphasize 
that I speak as a private individual, and not as a representative of a 
government agency.) 

I propose that there be created a new United States Statistical Com- 
mission, with responsibility for audit of statistical series, similar to an 
accounting audit, empowered to put a “certified” label on a statistical 
product. It should also be charged with investigation of methods, 
scope, and suitability of statistics, and with making recommendatiois 
for future improvements and developmental work. Such a Commis- 
sion in some respects would be similar to the “Boards of Visitors” 
which some universities have organized to report to the Board of 
Trustees; in others like the Inspector General of the Army; and in 
others like the present Research and Development Board in scientific 
fields. 
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The members of such a Commission might be selected from a panel 
of names suggested by this Association and allied associations dealing 
with important subjects to be considered by the Commission. Primar- 
ily, its membership would be drawn from experts outside government 
who have had actual experience in such operating problems as are 
faced in governmental statistics, in addition to knowledge of methodol- 
ogy. Like the United States Central Statistical Board, created in 1933, 
however, it might also have as members heads of some of the important 
Federal statistical agencies, serving either ex officio or on a consultative 
basis, where questions of scope and method were involved, but not, of 
course, an audit of their statistics. 

It should be a continuing body, serving on occasion as required, but 
with a small full-time staff, and adequate financing, so that our most 
distinguished statisticians, economists, scientists, and other specialists 
could reasonably be expected to devote time and attention to its work. 
Its reports should be made to the highest executive authority, and be 
generally available to the Congress and to the public. 

In making an audit of an important series, such a Commission should 
rely on sample checks to verify the accuracy of the basic data and 
the calculations, and should carefully examine the established pro- 
cedures to see whether they are being followed scrupulously. It should 
devote a good share of its attention to making suggestions for changes 
in procedures, if those currently in vogue are not in accordance with 
the best methods known at the time of review, and to considering 
whether the data are technically adequate for the uses which are being 
made of them. After such an examination, the Commission should be 
in a position to give these statistics a stamp of approval—“Certified 
Public Statistics’—or to withhold that stamp awaiting improvement 
—and say why. I, for one, would welcome the constructive suggestions 
and the heightened public interest that would come from such a review. 
Private statistical series might also be reviewed by the Commission, 
on request. 

To set up such a Coramission will involve both institutional inven- 
tiveness and new ways and means. I cannot venture to propose a work- 
able blueprint in all its details, but I can observe that, for government 
statistics, this is certainly an opportune moment for the creation of 
such a Statistical Commission. This idea, of course, is by no means 
original. A Central Commission of Statistics, quite large in size and 
with somewhat different functions than I am suggesting, has operated 
with considerable success in the Netherlands since the 1890’s, and we 
might learn from their experience. 
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The general goals I have outlined will not be easy to accomplish: 
To lift the level of statistical public relations; to make statistical train- 
ing more realistic; to devise means for establishing the title of ‘“‘certi- 
fied public statistician”; to set up an independent Commission to re- 
view and pass upon our most important statistics, governmental and 
private. 

But our profession has made some first steps in these directions. This 
Association has for many years appointed advisory committees to vari- 
ous governmental agencies, at their request—the Bureau of the Census, 
the Bureau of Labor Statistics, the Bureau of Mines, and others. These 
commit+ ~s have served most usefully in dealing both with program 
content und with statistical techniques. Similar committees should con- 
tinue to be appointed in the future. However, they have performed ad- 
visory functions of a somewhat different character than are intended 
for the proposed new Commission. The Committee on Government 
Statistics and Information Services of 1933, out of which came the 
U.S. Central Statistical Board and its current successor, the Office of 
Statistical Standards, was very valuable. Certainly the level of govern- 
mental statistics has been raised during the past 20 years, in conse- 
quence of its work. The contribution of the report of the Mills Task 
Force to the Hoover Commission, on government statistics, is such an- 
other step. Our Association’s Commission on Statistical Standards was 
created with a much more limited scope, but also in the direction of 
such functions as are proposed for a Statistical Commission. Its sub- 
committees which participated with the Social Science Research Coun- 
cil in reviewing the results of the election polls in 1948 and of the sta- 
tistics underlying the Kinsey Report were both very worth while. This 
Commission has not been very active, for it is voluntary and part-time, 
and has no organic relationship to statistical organizations as such. 

These few steps, however, are evidence of professional recognition of 
some of the problems that are before us. Recently, these problems have 
become much more serious, for statistics have suddenly attained stature 
beyond our hopes—and perhaps beyond our desires. Their importance 
goes beyond our present ability to handle the problems which these 
new uses create with the imagination and authority which the situation 
requires. As a profession, statisticians must organize to meet this chal- 
lenge if statistics are to continue to be administered in the public 
interest. 





DATA FOR MEASURING THE EFFECTIVENESS OF 
PUBLIC INCOME-MAINTENANCE PROGRAMS* 


Jacos FISHER 
Social Security Administration 


The effectiveness of the income-maintenance programs is 
best appraised in relation to program objectives. These may 
be considered as falling under two heads—coverage and bene- 
fits. Measures for appraising success in achieving coverage 
and benefit objectives are to be found in the periodic statistical 
reports and special studies of the administering agencies. Some 
of the measures acquire significance only when placed in rela- 
tion to general social and economic data. Since agencies vary 
in the amount and character of the information they collect 
and publish, and gaps exist in general social and economic 
data, the resultant body of knowledge is incomplete, particu- 
larly with respect to benefit adequacy. 


HE income-maintenance programs with which the present paper 
deals are public programs administered by public authorities, mak- 
ing money payments to individuals or families based either on need 
or on the occurrence of an event against which the individual and often 
his dependents are insured on the basis of past employment or service. 


More specifically, the programs are public assistance and the social 
insurance and related programs, i.e., unemployment insurance, dis- 
ability insurance, old-age insurance, survivors insurance, workmen’s 
compensation, and the compensation and pension programs for vet- 
erans. Industrial pensions, privately purchased accident and health 
policies, annuities and life insurance also help maintain income, but 
are excluded from consideration here because they present research 
problems of a somewhat different kind. 

The effectiveness of the income-maintenance programs is most ap- 
propriately appraised in relation to adequacy of coverage and ade- 
quacy of benefit. ; 

The social insurance programs provide protection for specified 
groups of workers against specified risks. A group is usually identified 
on the basis of industry, occupation or class of worker, or two of these 
categories in combination, and sometimes, in addition, by size of es- 
tablishment, or, as in some workmen’s compensation programs, by the 





* Based on a paper presented at the session on “The Application of Statistics in Appraising the 
Effectiveness of the Income-Maintenance Programs,” Committee on Statistics in the Social Sciences, 
American Statistical Association, Boston, December 28, 1951. Opinions expressed in this paper are those 
of the author and do not necessarily reflect the official views of the Social Security Administration. 
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hazardous nature of the employment. Further, protection under the 
social insurance programs is restricted to persons regularly attached to 
the employments covered, reflecting a general policy to exclude per- 
sons engaged only briefly or intermittently in jobs under the system. 

The coverage of the public assistance programs is defined in the 
eligibility conditions. Need, variously interpreted, is common to all 
of them, but they differ otherwise from program to program with re- 
spect to the age classes covered, the nature and severity of the disabili- 
ties recognized as bringing the applicant within the scope of the 
program, and the parental status of the children who could qualify 
for aid. 

The purpose of the benefit in social insurance and related programs 
is generally, but not always, the partial replacement of a wage loss. 
It is usually designed to cover the “basic” requirements of the bene- 
ficiary and, with the exception of veterans’ benefits, is related in amount 
to previous earnings. The relative emphasis given these goals varies. 
The proportion of wage loss compensated tends to be higher in the 
short-run programs, such as unemployment insurance and temporary 
disability, than in the retirement, survivor, and other long-run pro- 
grams; and it is higher in the programs with larger contribution rates, 
and in programs with a more specialized coverage. 

The purpose of the public assistance payment is to meet individual 
need. Wide divergence exists in the interpretation of need. The basic 
procedure in the determination of the assistance amount, not always 
honored in practice, is the comparison of an individual’s requirements 
and resources, the assistance payment representing the amount needed 
to bring total resources up to the requirements recognized in the 
assistance standard. States vary, however, in their assistance stand- 
ards; and even within a State, differences may occur in the determina- 
tion of need. While the States have made progress toward establishing 
a defined content of requirements to which local administering units 
must conform in the determination of the assistance amount, the iden- 
tification and appraisal of requirements and resources are, to a con- 
siderable extent, still largely discretionary with the local agency. 

What measures are available for gauging the extent to which the 
income-maintenance programs achieve their goals? 

One may turn first to the statistical series maintained by the agen- 
cies administering the programs. Special studies constitute another re- 
source, as does that branch of inquiry concerned with the analysis of 
the legal provisions of the program, the rules and regulations issued by 
the administering agency, and the decisions of appeals bodies. Not to 
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be overlooked, either, are the uses to which data on the general char- 
acteristics of the population, or of selected groups in it, can be put, 
since often what is known about the beneficiaries of a program ac- 
quires significance only when placed in relation to the demographic 
universe to which these individuals belong. Although the inventory 
which follows consists largely of program statistical series, these are 
not the only source of information for measures of program effective- 
ness. 

Since all earners and their dependents are presumably in need of 
income security in old age, and the dependents are all presumably in 
need of survivorship protection, the extent to which the Federal old- 
age and survivors insurance program and the special systems for 
government and railroad workers cover all employments is one meas- 
ure of the coverage of the groups at risk. This may be calculated by 
reference either to aggregate earnings or to persons employed. 

Data on earnings and average number engaged in employiuents 
under the Federal old-age and survivors insurance program, the rail- 
road retirement system, and the Federal, State, and local government 
retirement systems are available on an annual and for some programs 
on a quarterly and monthly basis.’ The relative protection of these 
programs may be measured by the proportion which earnings in covered 
employments represent of total earnings in the Department of Com- 
merce national income series; or by the proportion which the number 
in covered employments constitutes of the total number reported in 
the Census Bureau’s monthly report on the labor force. (Note that not 
all State and local government workers are covered by a retirement 
system.) 

In the Federal old-age and survivors insurance and railroad pro- 
grams, data have been developed as well on all persons in the covered 
employment during some part of the year. Because of the constant 
movement between covered and noncovered employment, more per- 
sons are likely to have some earnings credits under a given program 
during the year than in any one month. During 1950, for instance, 48 
million persons earned wages in employments covered under the old- 
age and survivors insurance program, but average monthly covered 
employment was 35 million. For some purposes such during-the-year 
employment figures are better than the during-the-week figures, and 
may be used in relation to total during-the-year employment as es- 
timated by the Census Bureau and the Social Security Administration. 
Unlike the case in the average monthly employment estimates, 





1 Sources for statistical series referred to are listed in the Note below. 
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during-the-year data cannot be added together because some indi- 
viduals appear in several series. 

Eligibility for retirement or survivor benefits in the old-age and 
survivors insurance and railroad systems is conditioned upon “insured 
status” which measures attachment to covered employment, or years 
of service. A more refined measure of the protection afforded than the 
number of persons in covered employment is the number with insured 
status or with service credits sufficient to make the worker or his sur- 
vivors eligible for benefit in case the event insured against should occur. 
Annual data on the number of earners with insured status are avail- 
able for the old-age and survivors insurance and railroad programs, 
as well as data on the number of years with earnings credits or service. 
These data may be related, for analytical purposes, to data on total 
employment, after adjustment for the number of insured persons who 
have left the labor force but still retain insured status. 

The employment and insured status data and the data on years 
with credited wages or services may be had by age, sex, industry (oc- 
cupation in the railroad retirement program), and for some items, by 
color, making possible analysis of the bearing of these factors on the 
extent of protection. The 1 per cent continuous work history sample 
of the Bureau of Old-Age and Survivors Insurance has yielded several 
studies of the effect of worker mobility and changes in economic con- 
ditions on patterns of covered employment for different age, sex, and 
industry groups. 

Measures of the coverage of extended disability programs may be 
found in the payroll and employment series of the railroad retirement 
system and of Federal, State, and local governments, to which refer- 
ence has already been made. Some earnings and employment series 
are also available to measure the coverage of the temporary disability 
programs in the 4 States with such legislation and in the railroad in- 
dustry. Analysis of the extent to which the disability insurance pro- 
grams cover all persons presumably in need of such protection may 
be made by reference to national earnings and employment data issued 
by National Income Division of the Commerce Department and by the 
Census Bureau. 

The extent of protection against the risk of unemployment is re- 
flected in comparisons which may be made between earners and earn- 
ings in covered employment and total wage and salary workers and 
their earnings. Data on total payrolls and payrolls subject to unem- 
ployment insurance contributions, by State, are available on an annual 
and quarterly basis, while data on the number of workers in covered 





ndi- 


and 
red 
A's 
the 
red 
sur- 
sur, 
ail- 


ice. 
tal 
vho 


ars 
oc- 


the 
ple 
ral 
on- 


ind 


ont 
er- 
ies 
ity 
in- 
ro- 
ay 
ied 
he 





EFFECTIVENESS OF PUBLIC INCOME-MAINTENANCE PROGRAMS 19 


employment during an average pay period are published on a monthly 
and during-the-year basis. An annual series also exists on the number 
eligible for benefits on the basis of covered earnings during the year, 
ie., the number with insured status. Similar data are issued for the 
railroad unemployment insurance program. 

Information on the coverage of workmen’s compensation programs 
is distinctly limited. Until the past decade the emphasis in statistics 
in this field was upon accidents and relatively little attention was paid 
to the insurance protection provided. The Social Security Administra- 
tion annually prepares estimates of covered payrolls for the country 
as a whole, but estimates are availab's for a few individual States only 
and these are in some instances incomplete because limited to par- 
ticular types of insurers (i.e., private carriers, self-insurers, State fund). 
Variations among the States in the employments covered, and in the 
kinds of insurers permitted, and the elective character of the law in 
some States, make for considerable differences in the information col- 
lected and in the comparability, from State to State, of what are pre- 
sumably the same items. 

Another group of coverage questions relates to the extent to which 
the income-maintenance programs are providing income to persons 
who have experienced an income loss, i.e., the nonworking aged, the 
widows and young children of deceased workers, the nonworking dis- 
abled, and the unemployed. 

Estimates of varying degrees of accuracy and currency are available 
for the size of each of these groups. The number of aged nonearners in 
the country may be obtained from the Census Bureau’s monthly report 
on the labor force, while the number and labor force status of widows 
and their! dependent children may be estimated from that Bureau’s 
annual estimates of the family composition and marital status of the 
population. The number of unemployed is readily obtainable from the 
same monthly report on the labor force. Estimates of the number of 
nonearner disabled persons in the working years of life, by age and 
sex and duration of incapacity, have been developed on the basis of 
special studies made in conjunction with the Census Bureau’s monthly 
population survey. 

The extent to which the social insurance and related programs pro- 
vide protection against a given income loss may be measured by relat- 
ing the number of beneficiaries in each of these classes to the total 
number experiencing the loss, after adjustments are made for differ- 
ences in definition. The appropriate program data are available to a 
greater or lesser extent for all programs with the exception of work- 
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men’s compensation (available in some States, however). For some 
programs, the data come with age, sex, race, industry or occupation 
detail. In some of the disability insurance programs there is detail 
also on the kind and duration of disability; and in the temporary dis- 
ability program, the number of spells of illness in the benefit year. 
These details are of importance in the analysis of the coverage effec- 
tiveness of the program for particular groups of workers. The relative 
restrictiveness of the disqualification provisions of State unemploy- 
ment insurance laws is reflected in the ratio of disqualifications to the 
estimated number of spells of unemployment. Data on the dependents 
of beneficiaries in relation to the provisions for dependents in the benefit 
schedule contribute to the appraisal of the completeness of protection 
with respect to this aspect of the program. 

The effective coverage of the public assistance program can be 
measured only by reference to the total number of needy persons in the 
community. The development of measures of need has, however, long 
defied our ingenuity and for good reason. Need is relative and its 
recognition varies from place to place and over time with living stand- 
ards, fiscal resources and other factors. The Department of Commerce 
issues annual estimates of the per capita income of the individual 
States; these are at best only rough measures of State differences in 
need. Distributions of family income by county will be available 
shortly from the 1950 census; when analyzed by family size they 
should provide valuable indicators of the relative number of families, 
county by county and State by State, living below a given income 
standard. Their usefulness in measuring need will be limited however 
by the time lag between the collection and publication of the data, 
differences from place to place in living costs, and by the different 
meanings which will be attached in each county to the significance of 
the data. 

Even if we could agree on a uniform standard for determining need, 
it would take nothing short of a house-to-house canvass to ascertain 
the number of persons in the community with incomes currently below 
the given standard. Public assistance must be applied for: the number 
of needy persons who might qualify but do not apply for public assist- 
ance is not known. What can be measured is the proportion of public 
assistance recipients in the total population, or in such classes as the 
aged. In June 1952, for instance, old-age assistance recipients per 
1,000 persons 65 years and over varied from 45 in the District of Co- 
lumbia to 631 in Louisiana. Recipient rates indicate the extent to 
which population groups are dependent in whole or part upon public 
assistance. Whether they also measure the extent to which need is 
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being met through public assistance will depend, among other things, 
on the scope of the public assistance law, its interpretation by the 
public assistance agency, and public sentiment. 

Monthly data are available on the number of recipients of each 
type of assistance. Recipient rates are published semi-annually; the 
population base in old-age assistance is the number of persons 65 years 
of age and over; in aid to dependent children, the number of children 
under 18 years; and in aid to the blind, in aid to the permanently and 
totally disabled and general assistance, the total population. The com- 
parability of State rates in the aid to the blind and disability assista.ice 
programs is affected by differences of unknown magnitude in the 
relative number of blind and disabled persons, State by State. Special 
studies have developed, in addition, information on the personal and 
family characteristics of recipients, the labor force history of old-age 
assistance recipients, the types and duration of disability encountered 
in the aid to the blind and aid to the permanently and totally dis- 
abled programs and the parental status and reasons for dependency of 
children receiving aid to dependent children. The periodic data and 
the special studies throw valuable light on the influence of differences 
in State eligibility standards and in the incidence of social problems 
in the population, as well as the effect of changes made or proposed in 
social insurance and public assistance legislation upon the relative 
number and kinds of persons receiving public assistance in different 
parts of the country. 

The coverage discussed thus far relates to individual programs. 
What is the net coverage of all the programs concerned with a given 
risk group? How many aged persons, for instance, are receiving income 
from Federal old-age and survivors insurance, the railroad and gov- 
ernment retirement programs, the veterans program, or old-age as- 
sistance? It is, of course, possible to bring together the relevant figures, 
not only for the aged, but for the other risk groups as well, such as the 
survivors, the unemployed, and the disabled. The difficulty with these 
compilations is general lack of information on the number of persons 
receiving income from two or more programs. The size of the groups 
receiving both old-age and survivors insurance and old-age assistance, 
and both old-age and survivors insurance and aid to dependent child- 
ren, has been available on a semi-annual basis since the middle of 
1951, but with this exception, methods have still to be devised for 
eliminating duplication among beneficiary and recipient rolls. In their 
present form the data are useful in indicating the minimum size of the 
group at risk lacking protection. 

Still another approach to net coverage, this time with reference to 
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State differences, is the use of per capita expenditures for aggregate 
social insurance benefits. State differences in this average presumably 
reflect differences in State provision for given risks (unemployment or 
temporary disability, for example); however, they also reflect program 
coverage limitations which have a differential impact on States because 
of differences in the industrial composition of the working population 
(old-age and survivors insurance, for example). State differences in 
the coverage of need may be measured, similarly, by reference to State 
per capital expenditures for public assistance. The latter average is 
affected, of course, by differences in the extent of need, as well as by 
differences in statutory provisions. 

The raw materials for the appraisal of benefit adequacy in the social 
insurance and related programs include data on aggregate payments, 
the number of beneficiaries, the size distribution of benefit payments, 
the wage loss being compensated, the dependents of beneficiaries, the 
proportion of beneficiaries exhausting benefit rights (in the short-term 
programs), the income of beneficiaries, and living costs. 

Aggregate payments are available for all programs; for the work- 
men’s compensation and State and local government retirement pro- 
grams, the aggregates are estimated. With the exception of workmen’s 
compensation, data or estimates are also at hand on the number of 
beneficiaries. This means that for most programs the average benefit 
can be computed. Size distributions, which are a better guide to the 
analysis of benefit adequacy than the average, are published for the 
Federal old-age and survivors insurance system, the Railroad Retire- 
ment Board programs, the Federal Government’s retirement system, 
and State unemployment insurance. Size distributions are not avail- 
able, at least in published form, for other government retirement pro- 
grams, the veterans program, the State temporary disability programs 
and workmen’s compensation. All the short-run programs collect and 
tabulate data on the proportion of beneficiaries exhausting benefit 
rights. 

The other types of data mentioned in the first paragraph of this 
section—the wage loss sustained by beneficiaries, the dependents of 
beneficiaries, the income of beneficiaries from benefits and other 
sources, and living costs for the most common family groupings en- 
countered among beneficiaries, at selected living levels—are not, with 
some exceptions, the by-product of program operations and are ob- 
tainable only through special study. Especially noteworthy in this 
connection are the field surveys conducted by the Social Security 
Administration in the past 10 years of the income and living arrange- 
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ments of old-age and survivors insurance beneficiaries. A number of 
States have made similar studies in the field of unemployment insur- 
ance. With these exceptions there is a general lack of quantitative 
data in the social insurance programs on the relation of the benefit 
amount to family requirements and the place cf the benefit in the family 
income structure. 

Certain derived measures have been found useful in benefit ap- 
praisal which are not dependent upon field study. These include 
average benefit as a per cent of the average wage of all workers in 
covered employment (this is not the same thing as the average wage 
of beneficiaries prior to retirement, disability, or unemployment), 
maximum benefit under the law as a per cent of average wage in 
covered employment, per cent of benefits which are at the maximum, 
the per cent of beneficiaries receiving public assistance, and changes 
over time in the average benefit as compared with changes over time 
in the Consumer Price Index of the Bureau of Labor Statistics. To a 
greater or lesser extent measures of this kind have been developed in 
the Federal old-age and survivors insurance program and in the State 
unemployment insurance programs. They help answer such questions 
as, How much wage loss is compensated by the benefit? Has the benefit 
formula kept pace with changing wage and price levels? Has the insur- 
ance program reduced the need for public assistance? The answers to 
these questions have an important place in legislative considération 
of the need for program amendment. 

The toois for measuring the adequacy of payments in public assist- 
ance are, with some exceptions, those referred to in connection with 
the social insurance and related programs: data on aggregate payments, 
the number of recipients, the size distribution of payments, the income 
of public assistance recipients, the size and characteristics of public 
assistance families, the trend in average payment in relation to the 
trend in the cost of living, and general information on living costs in 
relation to family size and composition. 

Statistical series on aggregate payments, recipients, average pay- 
ment per recipient, and the size distribution of payments are available 
for all five types of public assistance on a monthly or less frequent 
basis. Average payment and size distribution measure differences 
among the States in the amount of assistance available to recipients. 
They reveal little, however, on the relation of the payment to require- 
ments and on the extent of other income in the family. Special studies 
have developed information along these lines for old-age assistance 
and aid to dependent children recipients in selected States, but because 
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resources available to assistance recipients and their treatment by 
assistance agencies change with time, the applicability of the findings 
is limited to the period covered by the studies. Similar information is 
lacking for the other programs. 

Major gaps in knowledge concerning the extent to which the income- 
maintenance programs achieve their objectives are summarized below. 

With respect to coverage, there are no reported data and no regu- 
larly published estimates of the number of employees in employments 
covered by State and local government retirement systems or by 
workmen’s compensation. Coverage and benefit data in three of the 
four State temporary disability insurance programs are incomplete. 
Current estimates of beneficiaries under workmen’s compensation are 
lacking. With some exceptions, information is unavailable on the 
number of beneficiaries and public assistance recipients receiving in- 
come from two or more programs. 

Benefit data are deficient in a number of respects. There is no in- 
formation for the country as a whole on size of benefit in the fields of 
workmen’s compensation and State and local government retirement 
systems. Average payments are available for veterans’ benefits but not 
size distributions of benefits. Generally missing in all social insurance 
and related programs with the exception of Federal old-age and sur- 
vivors insurance and some State unemployment insurance programs 
are any efforts at appraisal of the benefit in relation to other sources 
of income, family requirements, and living costs. With these excep- 
tions, again, no materials have been developed to relate the benefit to 
the wage loss, or to estimate the number of beneficiaries receiving 
public assistance supplementation. In public assistance the chief de- 
ficiencies are in current data on the nonassistance income of recipients 
and organized and analyzed information on the budgetary standards 
used by public assistance agencies for estimating requirements. 

With the exception of public assistance, unemployment insurance, 
and Federal old-age and survivors insurance, small-area data on bene- 
ficiaries and their benefits are generally lacking. 

More difficult to classify because not directly related to program 
operations are deficiencies in the documentation of some of the com- 
monest generalizations in the field of social security. We say the aged 
prefer insurance to assistance. How do we know this? Has anyone 
bothered to find out? To what extent does a contributory system of 
social insurance, with benefits paid as a matter of right and based on 
earnings, preserve “self-reliance and initiative ... dignity and inde- 
pendence,” reward “ambition and effort,” and encourage private 
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savings, as claimed by the House Ways and Means Committee in 
1949?2 What proof do we have that the Bureau of Employment Se- 
curity is correct in stating that unemployment insurance “sustains the 
morale and conserves the skills and standard of living” of the unem- 
ployed?* Is there any objective basis for the claim of the House Ways 
and Means Committee that under a contributory social insurance 
system “productivity is encouraged,” the “Nation’s total production 
is increased,” and that such a system helps “to protect the Nation 
from serious economic maladjustment?”* 

To ask these questions is to call attention to the complete absence of 
attitude research in the income-maintenance field. Such research may 
have no direct bearing on day-to-day program operations, but the 
lack of it is painfully apparent when public assistance comes under 
legislative or newspaper attack, or when the alleged merit or lack of 
merit of the reserve fund in old-age and survivors insurance is publicly 
aired. It would be helpful then to know what the general public thinks 
of the issues involved, what the beneficiaries or recipients think, and 
what the views are of the rank and file staff member working in 
the program, newspaper editors, legislators, and others. 

Reference should be made also to the paucity of bench marks by 
which to measure benefit adequacy. The lacunae here are the primary 
responsibility of other kinds of programs. We lack detailed information 
on the relative importance of different sources of income at various 
income levels, at different ages, and among different family types and 
sizes. Research in this area would put in another light the social and 
political issues posed, for instance, by the contradiction between the 
income a good public assistance standard would provide and what the 
family earner in a low-paid job could earn. 


NOTE 
Published sources for the statistical series referred to in this article 
are listed below. 
Earners, earnings, insurance status, benefit payments, and beneficiaries an wad 
social insurance and related programs: 


Federal Security Agency, Social Security Administration, Social Security Bulle- 
tin (monthly), including Annual Statistical Supplement, published as part 
of the September issue. 

Federal Security Agency, Social Security Administration, Bureau of Old-Age 





2 House of Representatives, Committee on Ways and Means, Report to Accompany H.R. 6000, Re- 
port No. 1300, 81st Congress, Ist session, August 22, 1949, pp. 2, 3. 

3 Department of Labor, Bureau of Employment Security, Unemployment Insurance: Purposes and 
Principles, Dec. 1950, p. 1. 
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and Survivors Insurance, Quarterly Summary of Wage, Employment and 
Benefit Data (Feb., May, Aug., Nov.). . 
—-Handbook of OASI Statistics (annually). 

Department of Labor, Bureau of Employment Security, The Labor Market 
and Employment Security (monthly), including monthly Statistical Supple- 
ment and quarterly supplement Employment and Wages of Workers Covered 
by State Unemployment Insurance Laws by Industry and State. 
—Significant Temporary Disability Data (annually). 

Railroad Retirement Board, The Monthly Review. 

—Annual Report. 

U. 8S. Civil Service Commission, Retirement Report (annually). 

Veterans Administration, Statistical Summary, Claims (monthly). 
—Unemployment Allowance, Claims and Payment Data (monthly). 

—Self Employment Claims and Payment Data, by Agencies (Monthly). 
—Annual Report. 


All State unemployment insurance agencies issue recurrent data periodically, in 
greater or lesser detail. The publications of the California, New Jersey, and 
Rhode Island agencies contain, in addition, data on the temporary disability 
programs in these States. 


Program data of varying degrees of completeness may be found in the reports of 
State workmen’s compensation agencies. Some but by no means all State and 
local government retirement systems include beneficiary data in their annual 
reports. 


Recipients and payments under public assistance programs: 


Federal Security Agency, Social Security Administration, Social Security Bul- 
letin. Recipients and payments appear monthly. Recipient rates are com- 
puted semi-annually and published in the March and October issues. Annual 
per capita expenditures for public assistance are to be found in the March 
issue. 

— Assistance Payments Under State-Federal Programs. Annual release on the 
size distribution of payments for the month of September. 


Periodic data are published by all State public assistance agencies. 
Economic status of selected population groups: 


Federal Security Agency, Social Security Administration, Social Security Bulle- 
tin, June and December. Semi-annual estimates of the number of aged per- 
sons, widows, and paternal orphans with income from employment, social 
insurance and related programs and public assistance. 


General population and economic data useful in developing coverage estimates: 


Department of Commerce, Bureau of the Census, Monthly Report on the Labor 

Force (Current Population Reports, Labor Force, Series P-57). 

—Annual release, usually for the month of April, on marital status, family 
composition, and hcusehold characteristics of the population. (Current 
Population Reports, Population Characteristics, Series P-20.) 

—Bureau of Foreign and Domestic Commerce, Office of Business Economics, 
Survey of Current Business (monthly), for estimates of personal income, by 
source, and State per capita income. 
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and Some special studies referred to in the text: 
Theodore D. Woolsey, “Estimates of Disabling Illness Prevalence in the 
rket United States.” Public Health Monograph No. 4, Public Health Service, 
1952. 
ple- 


Edna C. Wentworth, “Resources of Aged Insurance Beneficiaries: 1951 Na- 
tional Survey,” Social Security Bulletin, August 1952. 

Edna C. Wentworth, “Income of Old-Age and Survivors Insurance Bene- 
ficiaries, 1941 and 1949,” Social Security Bulletin, May 1950. 

Lelia M. Easson, ‘‘Adequacy of the Income of Beneficiaries Under Old-Age 
and Survivors Insurance,” Social Security Bulletin, Feb. 1948. 

The Role of Unemployment Compensation in Maintaining Family Income and 
Expenditure in an Area of Critical Unemployment. National Opinion Re- 
search Center, Chicago, 1951. 

A Report on Benefit Financing and Solvency of the Employment Security Fund in 
Rhode Isiand. Rhode Island Department of Employment Security, Provi- 

, in dence, 1950. 


red 


and Aid to the Blind Recipients With Earnings in September 1950. Public Assistance 
lity Report No. 19, Social Security Administration, 1952. 

Charles E. Hawkins, ‘“‘Old-Age Assistance Recipients: Reasons for Nonentitle- 
s of ment to Old-Age and Survivors Insurance Benefits,” Sociai Security Bulletin, 
and July 1952. 
ual Elizabeth Alling and Agnes Leisy, Aid to Dependent Children in a Postwar Year. 


Social Security Administration, 1950. 
Ruth White and Thomas G. Hutton, Requirements and Incomes of Recipients of 
Old-Age Assistance in 21 States in 1944, Public Assistance Report No. 13, 


Ful- Social Security Administration, 1948. 

ym- “Living Arrangements and Physical Condition of Aged Recipients, 21 States,” 

ual Social Security Bulletin, June 1947. 

rch Ralph G. Hurlin, Sadie Saffian, Carl E. Rice, Causes of Blindness Among Re- 
cipients of Aid to the Blind. Social Security Administration, 1947. 

the Families Receiving Aid to Dependent Children, October 1942, Public Assistance 


Report No. 7, Social Security Administration, 1945. 
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AN APPRAISAL OF THE 1950 CENSUS 
INCOME DATA* 


Herman P. MILLER 
Bureau of the Census 


HE establishment of a program designed specifically to measure 

the quality of the data obtained in the Census is one of the impor- 
tant differences between the 1950 Censuses of Population, Housing, 
and Agriculture and those previously conducted. Several reports on 
the quality cf the data obtained in the Census, which will incorporate 
the findings of the Post Enumeration Survey (a reinterview survey 
conducted by highly trained enumerators after the Census was com- 
pleted) and the record matching studies, are in various stages of com- 
pletion. The purpose of the present article is to provide an interim re- 
port on the reliability of the 1950 Census income data. 

In this paper, comparisons are made between the income size distri- 
butions obtained from preliminary tabulations of the 1950 Census and 
those obtained from the income survey conducted by the Census Bu- 
reau in March 1950 as a supplement to the Current Population Sur- 
vey. In addition, estimates of aggregate income derived from each of 
these surveys are compared with independent estimates prepared by 
the National Income Division of the Department of Commerce.' The 
comparison of the aggregates emphasizes the well established fact that 
data in all field surveys of income are subject to errors of response and 
nonreporting which produce underestimates of the level of income.’ 





* The author is greatly indebted to Edwin D. Goldfield and Gertrude Bancroft of the Bureau of the 
Census who directed most of the work summarized in this paper. Thanks are also due to Leon Paley and 
Robert Stein for their able assistance in all phases of the work. , 

1 All of the 1950 Cenaus data to be discussed were obtained from a preliminary sample of Census 
returns which will be referred to as the Preliminary Sample Tabulations (PST). These data will be 
compared with information obtained in the Census Bureau’s Current Population Survey (CPS) and 
with the income aggregates prepared by the National Income Division of the Department of Com- 
merce (NID). Both the CPS and PST data are based on samples and are therefore subject to sampling 
variation. Figures based on relatively small numbers of cases, as well as small differences between figures, 
should be used with particular care. Estimates of the magnitude of sampling variation for the CPS 
figures may be obtained from the Census Bureau report, Series P-60, No. 7 and corresponding figures 
for the PST data may be found in the report, Series PC-7, No. 2. 

2 Virtually all analyses of income data obtained from household interviews show that this method 
produces an understatement of income. The most complets appraisal of the available data on the size 
distribution of income appears in Volume 13 of Studies in Income and Wealth, published by the National 
Bureau of Economic Research. The articles in this volume by Goldsmith; Mandel; and Wasson, Hur- 
witz, and Schweiger are particularly useful in this connection. The Bureau of Labor Statistics report, 
Family Spending and Saving in Wartime, Bulletin No. 822, contains a detailed appraisal of the income 
data obtained in the Survey of Family Spending and Saving in Wartime as well as brief indications of 
the extent of underreporting of income in other field surveys such as the Consumer Purchases Study, 
1935-1936; Minnesota Income Study, 1938-1939; and the 1940 Census. The only detailed appraisal 
of the wage or salary income data obtained in the 1940 Census appears in an unpublished memorandum 
prepared by the staff of the Census Bureau. 
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The Census Bureau has noted in almost all of its income reports that 
income information obtained in household interviews is largely “based 
on memory rather than on records, and in the majority of instances 
on the memory or knowledge of some one person, usually the wife of 
the family head. The memory factor in data derived from field sur- 
veys of income probably produces underestimates because the tendency 
is to forget minor or irregular sources of income.”* At the same time, 
the data suggest that despite these deficiencies, the ‘nformation ob- 
tained in the field surveys may be sufficiently reliable for most uses. 
In addition to the defects inherent in the household interview ap- 
proach, the 1950 Census had some special problems in regard to the 
collection of income data. For example, the enumerators used in the 
Census were less skillful, less adequately prepared for their jobs, and 
more burdened with procedural details than are the enumerators used 
in the Current Population Survey (CPS). In addition, although the 
“line” schedule used in the 1950 Census was excellent for certain pur- 
poses, it was not well adapted for the collection of family income data.‘ 
Despite these handicaps, the 1950 Census income data show distribu- 
tions quite similar to those obtained in the CPS, which has produced 
a consistent and useful series of national estimates of the distribution 
of consumer income for each year since 1944. The similarity of the 
CPS and Census income data despite the limitations noted above may 
in part merely reflect the fact that within the framework of the collec- 
tion techniques currently employed by the Census Bureau, the differ- 
ences which will be obtained by alternative forms of questioning will 
not be striking. This was demonstrated in one of the 1950 Census pre- 
tests.5 It is also possible that some of the defects inherent in the 1950 





2 U. 8. Bureau of the Census, Current Population Reports—Consumer Income, Series P-60, No. 9, 
“Income of Families and Persons in the United States: 1950,” March 25, 1952, p. 19. 

4 The family income data in the 1950 Census were obtained by asking questions separately (a) for 
the head of the family, and (b) for all relatives of the head as a group. This method does not probe as 
deeply as the CPS income supplements in which income questions are asked individually for each mem- 
ber of the family. 

5 Several different types of income questicns were tested by the Bureau of the Census in the April 
1948 Current Population Survey for the purpose of developing a simplified method of obtaining income 
data, suitable for the 1950 Census. A total of 25,000 households were interviewed by trained enumera- 
tors and the following alternative sets of income questions were asked of systematic subsamples of the 
total sample. 

Type 1.—On one-half of the schedules the enumerator was to ask the specific amount of (a) 
wages and si'* ries; (b) total money income; for each person 14 years and over. 

Type 2.—n one-fourth of the schedules the enumerator was to ask the specific amount of 
total money income for each primary family and for each person not a member of a primary 
family. 

Type 3.—On one-fourth of the schedules the enumerator was to ask the respondent to indicate 
on a card the broad class interval into which the total money income fell for each primary family 
and for each person not a member of a primary family. 


The findings of the field tests of each of the questions are presented in the table below. One of the 
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Census were balanced by better respondent cooperation on the aver- 
age, despite occasional strong individual objection. The legal provisions 
of the Census may have stimulated more accurate reporting on the 
part of respondents. 

TABLE 1 
PST AND CPS ESTIMATES OF NUMBER OF INCOME RECIPIENTS 


AND MEDIAN INCOME, BY TYPE OF INCOME, FOR 
THE UNITED STATES: 1949 











CPS 
PST (Noniastitutional 
Number of income recipients and (Total population 14 years 
median income, by type population old and over, 
of income 14 years old excluding members 
and over) of armed forces 
on post) 
Es 6 6th cas neendsevans 111,926,000 109 ,644 ,000 
Total number of income recipients... .. 72 ,054 ,000 71,768 ,000 
Number of recipients of: 
Wage or salary income.............. 54,974,000 54,912,000 
Self-employment income............ 10,674,000 10,966 ,000 
Income other than earnings......... 18,813 ,000 16,875,000 
Median income: 
oni cinedbsaeraneanes $1,909 $1,814 
Wage or salary income.............. $2 ,032 $2,016 
Self-employment income............ $1,586 $1,039 
Income other than earnings......... $ 478 $ 496 





INCOME OF PERSONS 


1. Comparison of CPS and PST income data for persons. Essentially 
the same techniques were used to obtain personal income data in the 
March 1950 Current Population Survey and in the 1950 Census. There- 
fore, it is of some interest to compare the results. Table 1 presents a 
summary of the numbers of income recipients and medians for each 





striking conclusions derived from these tests was that asking for a global figure on total income for the 
family group (type 2) and asking for information on wages or salary and total income for each family 
member (type 1) produced about the same median income. 


MEDIAN TOTAL MONEY INCOME FOR FAMILIES AND INDIVIDUALS BY TYPE 
OF SCHEDULE, FOR THE UNITED STATES, URBAN AND RURAL: 1947 











Area Type 1 Type 2 Type 3 
SEE ET CO PTET $2,702 $2,741 $2,581 
I iia otha Ae KO Een dee eR Ee Ra 2,961 2,985 2,741 
EE EE CREE OT Ce ee 2,607 2,620 2,544 


I ciiticncectdeccscsuseciucseuanwes 1,760 1,896 1,923 
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type of income. Although the data require adjustment to make them 
comparable with respect to population coverage (see text below), they 
provide a basis for comparing the PST and CPS income distributions 
at the most crucial points. 

With the exception of income other than earnings, PST and CPS ap- 
pear to have reported about the same number of recipients of each type 
of income. The apparent similarity of these estimates is somewhat mis- 
leading inasmuch as the two surveys are not directly comparable with 
respect to population coverage. The PST included about 1.6 million 
institutional inmates,® and about 0.3 million members of the armed 
forces living on military posts who were excluded from CPS. Since 
these persons are in the population base from vhich the PST estimates 
of the numbers of income recipients are made, it is apparent that the 
PST estimates are somewhat overstated as compared with CPS.’ If 
tie institutional inmates and members of the armed forces living on 
post were excluded from the PST weighiting of the data, the total num- 
ber of income recipients in PST would have been estimated to be about 
70.6 million rather than 72.1 million as shown in the table. Experience 
in the reconciliation of other estimates of this type indicates that the 
discrepancy of about 1 million between the CPS and the adjusted 
PST estimate of the number of income recipients is relatively small. 
For example, considerably greater differences in the estimates of the 
number of paid workers during a given year are usually obtained from 
the CPS surveys of work experience conducted in December and the 
income surveys conducted in April. 

It is also apparent from Table 1 that with the exception of income 
from self-employment, the medians obtained for each type of income 
in PST and CPS were about the same. However, a closer look at the 
full distributions by income levels in Table 2 indicates that PST con- 
sistently shows a larger proportion of income recipients in the upper- 
income brackets. The percentage differences do not appear great; how- 
ever, since the upper-income groups possess a large share of the aggre- 
gate income, the small percentage differences shown in the table yield 
relatively large differences in the aggregates. (See page 33 ff. for a de- 
tailed discussion of the aggregate comparisons.) 





6 Income information was very poorly reported for institutional inmates in the 1950 Census. Of the 
1.6 million inmates of institutions 14 years of age and over, an estimated 1.1 million did not report on 
income, 0.3 million reported no income, and 0.2 million reported $1 or more of income. 

7 No adjustment was made for the fact that the PST estimate of the civilian noninstitutional popu- 
lation is 0.4 million greater than the CPS estimate since this represents a difference in the independent 
estimate of the same population rather than a difference in population coverage. The difference between 
these two figures arises from the fact that the population estimate used to inflate the CPS data is a 
Projection of 1940 Census data, whereas the PST estimate represents an inflation of a sample of 1950 
Census returns. 
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The discrepancy of approximately $600 between the PST and CPS 
estimates of median self-employment income is rather large. A full ex- 
planation for this difference is not available at present. Perhaps the 
Census obtained more accurate income data than CPS in farm areas 
because of the use of the detailed farm schedule in the Census of Agri- 
culture, which tended to improve the estimate of net income from farm 
self-employment on the population schedule. This improvement would 
have been reflected in less underreporting of income or in the higher 


TABLE 2 


PST AND CPS DISTRIBUTIONS OF PERSONS 14 YEARS OF AGE AND 
OVER AND OF AGGREGATE MONEY INCOME, BY TOTAL 
MONEY INCOME, FOR THE UNITED STATES: 1949 

















CPS 
Noninsti- 
PST tutional 
population 
Total A 14 years old A 
Totai money income population gere- and over = 
l4yearsold . gate (excluding .&*“° 
income income 
and over members 
of armed 
forces 
on post) 
(thousands) (billions) (thousands) (billions) 
| AN ee 111,926 $171.4 109 ,644 $158.0 
Total with income........... 72 ,054 _- 71,768 _ 
Per cent of those with income. 100.0 100.0 100.0 100.0 
Loss or $1 to $499........... 17.3 1.8 18.9 2.1 
SB Bete BiBiccccccccec. 13.5 4.3 13.8 4.7 
$ 1,000 to $1,499........... 10.9 5.7 10.8 6.1 
$ 1,500 to $1,999........... 10.2 if 10.4 8.3 
$ 2,000 to $2,499........... 11.2 10.6 11.6 11.9 
$ 2,500 to $2,999........... 9.1 10.6 9.5 11.9 
$ 3,000 to $3,499........... 9.0 12.3 8.7 12.8 
$ 3,500 to $3,999. .......... 5.6 8.9 5.4 9.2 
$ 4,000 to $4,499........... 4.0 ce 3.5 6.8 
$ 4,500 to $4,999........... 2.2 4.4 2.0 4.3 
$ 5,000 to $5,999........... 2.9 6.8 2.3 5.7 
$ 6,000 to $6,999........... 1.3 3.6 8.2 3.2 
$ 7,000 to $9,999........... 1.4 5.0 1.0 3.9 
$ 10,000 and over........... 1.4 11.5 1.0 9.1 
Median income............. $1,909 —_ $1,814 — 
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median income of farm families. (See Table 6.) It is also possible that 
the Census income data benefited from the self-enumeration procedure 
which was supposed to have been used in the Census of Agriculture 
in all farm areas except the South; however, in the absence of any firm 
data on the effects of self-enumeration or the extent to which it was 
actually used in the Census of Agriculture, this point can only be a 
matter of conjecture. The only available evidence on this point comes 
from a pretest of the Census conducted in October 1948 in Union 
County, Indiana, and Carroll County, Kentucky, to determine the 
relative merits of self-enumeration and direct-enumeration techniques. 
This pretest showed that in about 60 per cent of the cases where the 
self-enumeration procedure should have been used, the Census enu- 
merators rather than the respondents actually completed the schedules. 
Even where the respondents entered the income information on the 
self-enumeration schedules, a large proportion of the entries were ap- 
parently rounded to hundreds of dollars indicating that the respondents 
probably did not take advantage of the advance distribution of sched- 
ules to refer to records. Moreover, there were no significant differences 
in the medians produced by each procedure. 

2. Comparison of CPS, PST, and NID aggregates. Although the pri- 
mary purpose of the income questions in the 1950 Census and in the 
March 1950 Current Population Survey was to provide a distribution 
of the population by income levels, estimates of the aggregates of each 
type of income can be derived from these data. The comparison of 
these estimates with those prepared by the National Income Division 
of the Department of Commerce provides further information regard- 
ing the reliability of the 1950 Census income data. Such a comparison 
is made in Table 3. The Census aggregates shown in this table largely 
represent estimates derived from the tabulated data and are subject 
to substantial errors of estimation. Moreover, despite the fact that the 
NID aggregates are largely based on record data and may therefore 
be considered more reliable than the field survey data, they are also 
subject to errors of estimation. For these reasons, small differences in 
the results must be interpreted with care. 

The PST and CPS estimates of aggregate income shown in the table 
below were computed by multiplying the estimated number of per- 
sons in each income interval by the average income for that interval. 
For intervals between $500 and $10,000, the midpoint of each interval 
was assumed to be the average; $250 was selected as the average for 
the “Loss or $1 to $499” level, and $20,000 was selected as the average 
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for the $10,000 and over level.* The aggregates were obtained by mul- 
tiplying the frequercies by the average income for each interval. The 
NID aggregates with which the PST and CPS aggregates are compared 
were obtained by adjusting the Personal Income Series of the Depart- 
ment of Commerce to make it more comparable with the definition of 
income and the population coverage of the Census income data.® 


TABLE 3 


PST, CPS, AND NID ESTIMATES OF AGGREGATE INCOME, 
BY TYPE OF INCOME, FOR THE UNITED STATES: 1949 


(Billions) 








CPS 

PST (Noninstitutional 

(Total population 14 

population years old and 
$4 yonen old over, excluding 
ont ovat members of the 

armed forces 

on post) 


NID 
(Total 
population 
14 years old 
and over) 


Type of income 





Total income $171.4* $158 .0* $187.6 
Wage or salary income 124.3 120.0 127.5f 
Self-employment income 31.1 26 .5t 29.3 
Income other than earnings.... 16.6 13.3 30.8 





* The detail by type of income does not add to the total because each of the aggregates was esti- 
mated independently. 

+ Nonfarm self-employment estimate: $19.1 billion; farm self-employment estimate: $7.4 billion. 

t Nonfarm self-employment estimate: $19.4 billion; farm self-employment estimate: $9.9 billion. 


Perhaps the most striking fact shown in the above table is that the 
PST obtained about 90 per cent of the comparable NID estimate of 
aggregate income. This proportion is considerably greater than that 
obtained in any of the CPS income surveys which have been conducted 
to date. The March 1950 CPS obtained only 84 per cent of the NID 
aggregate. 





8 The arithmetic mean for the “tail” of an income distribution having an open-end interval of 
$10,000 or more is generally estimated from a Pareto curve. However, it was found after some investiga- 
tion that although this method yields reasonable estimates for the distributions of families and unrelated 
individuals, it tends to overstate the mean for distributions of persons. An examination of income tax 
returns for 1944-1949 showed that in each of these years the mean adjusted gross income for returns 
with $10,000 or more of adjusted gross income was between $21,000 and $22,000. On the basis of this 
and other evidence it was assumed in deriving the data shown in Tables 2 and 3 that $20,000 was a 
yeasonable approximation of the mean for the open-end interval. 

* A detailed description of the method used to adjust the Personal Income Series for comparability 
with the Census income data is described in a paper by S. F. Goldsmith, “Appraisal of Basic Data Avail- 
able for Constructing Income Size Distributions,” which appears in Studies in Income and Wealth, 
Volume 13 (National Bureau of Economic Research). 
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For the reasons noted in a preceding section, the PST and CPS esti- 
mates of aggregate income shown in the table are not directly com- 
parable. If institutional inmates are excluded from the PST tabula- 
tions, the estimated aggregate income is reduced from $171.4 billion 
to $168.0 billion. An additional adjustment to exclude members of the 
armed forces living on post from the PST data reduces the aggregate 
about $1 billion more to about $167.0 billion. Thus, the actual differ- 


TABLE 4 


PERSONS 14 YEARS OF AGE AND OVER BY TOTAL MONEY 
INCOME IN CPS AND IN THE CENSUS, FOR THE 
UNITED STATES 








Paid workers 





Comparison of CPS and Census All Wage or Self- 
persons salary employed 
workers workers 





Total in sample 2,529 589 

Total reporting on income in CPS 
and Census 2,165 442 

Pe. cent in same income interval 
49 52 31 

Per cent in higher income inter- 
val in CPS 27 25 38 

Per cent in higher income inter- 
val in Census 25 23 31 

Median total money income 

$1,914¢ $2,326 $2,389 $1,797 
$1,849 $2,238 $2,319 $1,543 





* The income intervals used were those shown in Table 2 with the exception that $6,000 to $9,999 
was treated as a single interval. 
t These figures differ from those shown in Table 2 because of sampling variation. 


ence between the PST and CPS aggregates when adjusted for popula- 
tion coverage is about $9 billion. 

3. Income data from CPS-Census Match. About one-fifth of the per- 
sons who reported on income in the March 1950 CPS were also required 
to furnish income data in the 1950 Census. The differences between the 
two sets of reports show the extent of variability of response in these 
surveys. They also indicate some of the concomitant effects of this 
variability on the income distributions. The data in Table 4 are based 
on an analysis of the income reported by 5,700 persons 14 years of age 
and over who were interviewed in both CPS and in the 1950 Census. 











36 AMERICAN STATISTICAL ASSOCIATION JOURNAL, MARCH 1953 


Of the 4,898 persons who reported on income in both surveys, 61 
per cent were in the same income interval in both surveys, 20 per cent 
were in a higher income interval in CPS, and 18 per cent were in a 
higher income interval in the Census. (About 82 per cent of the per- 
sons reporting on income in both surveys were either in the same in- 
terval or in one higher or lower adjacent interval.) Despite the fact 
that nearly two-fifths of the respondents were reported at different 
income levels in each survey, there was no significant difference be- 
tween the medians or between the distributions by income levels. The 
similarity in the distriLutions largely reflects the fact that income was 
relatively overstated in each survey about as frequently as it was un- 
derstated. These reporting and enumerative errors generally tended 
to cancel, leaving the income distributions relatively unchanged. De- 
spite the similarity of the over-all distributions, there may be impor- 
tant differences between ..1e CPS and Census income distributions for 
certain segments of the population. The relatively small size of the 
CPS-Census Match sample precludes any detailed examination of the 
income effects associated with variability of response at the present 
time. A detailed study of this characteristic wiil be included in the 
analysis of the results of the Post Enumeration Survey. However, the 
analysis of income differences by class of worker (Table 4) and by age, 
sex, and veteran status (Table 5) does not indicate that variability 
of response resulted in any significant differences for these character- 
istics. 

Over four-fifths of the persons reporting no income in the March 
1950 CPS also reported no income for 1949 in the Census, but only 
about one-half of the persons reporting some income in CPS were in 
the same income interval in the Census. Some indication of the extent 
of variation in response among income recipients may be obtained 
from the figures for paid workers in Table 4. About one-half of the wage 
or salary workers were in different income intervals in both surveys; 
however, the medians for these workers do not differ significantly. As 
might be expected, there was considerably greater variation of re- 
sponse for self-employed workers than there was for wage or salary 
workers. About 70 per cent of these workers reported different incomes 
in the two surveys. Even here, however, incomes were underreported 
almost as frequently as they were overreported. 

Table 5 indicates that about three-fourths of the women as com- 
pared with only one-half of the men reported the same income in both 
surveys. However, nearly half of the women in the sample but only 6 
per cent of the men reported no income. Among income recipients, 
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TABLE 5 


CONSISTENCY OF INCOME REPORTING IN CPS AND IN THE 
CENSUS FOR PERSONS 14 YEARS OF AGE AND OVER BY 
SEX, AGE, AND VETERAN STATUS 








Per cent in same 
income interval in CPS 


and the Census 
Median income 








Sex, age, and veteran status Persons Income 
reporting recipients CPS Census 
onincome in CPS 

inCPSand_ and 
Census Census 





BM 6sesévssedwsencens 61 44 $1,914 $1,849 
EIT 49 44 2,432 2,378 
14 to 24, Total............... 56 43 1,054 1,289 
14 to 24, Veteran of World War 
bbs 6bekabecdenesennnuues 48 48 2,185 2,231 
14 to 24, Nonveteran of World 
MAbs dc nedsescencsnade 58 41 722 897 
f, . eeerrr rere. 43 43 2,904 2,850 
25 to 44, Veteran of World War 
Pk dénecdscseseanseduasbane 43 43 2,955 2,918 
25 to 44, Nonveteran of World 
WE Bcc cccsesconcsoences 43 43 2,860 2,773 
Sh snksdcnsscesessseces 46 46 2,644 2,646 
SP GUD oo hccnvcssvcsces 48 44 1,037 1,234 
Ps n.ctevensesecsenses 73 43 910 941 
0 ee 72 47 838 992 
ee ee 73 43 1,271 1 ,293 
GP cndedccccesesnsaceee 72 41 888 915 
SP OE. « « cvcnnsecccacce 69 45 575 497 





about the same proportion (one-half) were in the same income level 
for both sex groups. The medians and distributions by income level in 
CPS and the Census differed by negligible amounts for both males and 
females. 

Within each sex group, the proportion of income recipients who were 
in the same income level in both surveys did not vary by age or veteran 
status. The differences by age and veteran status in the median in- 
comes derived from the two surveys were also insignificant. 


INCOME OF FAMILIES 


1. Comparison of CPS and PST income distributions. The family in- 
come data obtained in the March 1950 CPS represent about the best 
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estimate of the distribution of families by income level which can be 
made within the limitations of the collection techniques thus far used 
by the Census Bureau. For this reason, despite their shortcomings, 
these data provide a standard with which the 1950 Census income data 
can be compared. 

Table 6 shows that there is no significant difference between the 
PST and CPS median income for families within each color and resi- 
dence group.!° The more detailed distributions by income level and 
color in Table 7 indicate a similar correspondence between the results 
obtained in these surveys. The one marked difference between the two 
distributions is the larger proportion of “zero-income” families obtained 
in the PST results. This deficiency in the PST data resulted largely 
from the enumerative and editing procedures used in the 1950 Census 
and is discussed in some detail below. 


TABLE 6 


PST AND CPS ESTIMATES OF MEDIAN TOTAL MONEY INCOME OF 
FAMILIES, FOR THE UNITED STATES, URBAN AND RURAL 














All classes White Nonwhite 
Area 
PST CPS PST CPS PST CPS 
United States...... $3,068 $3,107 $3,216 $3,232 $1,425 $1,650 
0 3,429 3,486 3,581 3,519 1,850 2,084 
Rural nonfarm..... 2,552 2,763 2,711 2,851 1,141 1,240 
Rural farm........ 1,734 1,587 1,936 1,757 733 691 





Although the medians obtained in CPS and PST are substantially 
the same, patterns of small differences which appear in the data may 
be of some significance. Among both white and nonwhite families the 
CPS medians in urban and rural-nonfarm areas are slightly (though 
not significantly) higher than the corresponding PST results; however, 
among rural-farm families PST showed somewhat higher medians than 
CPS. These differences may reflect differences in the procedures used 
in these areas. In nonfarm areas essentially the same procedures were 
used in CPS and Census. Although a’small part of the differences be- 
tween CPS'and PST are attributable’ to the fact that college students 
living away from home were included as family members in CPS but 
not in the Census, it is likely that the higher CPS results in these areas 





10 Unrelated individuals are excluded from this comparison because they are defined somewhat 
differently in CPS and in the Census. For example, college students living away from home are treated 
as family members in CPS and as unrelated individuals in the Census. 








[ 1953 


n be 
used 
ngs, 
lata 


the 
esi- 
and 
ults 
two 
ned 
rely 
sus 


in 











AN APPRAISAL OF THE 1950 CENSUS INCOME DATA 39 


reflect the superior schedule design and better enumeration. The rea- 
sons for the higher PST estimates of farm income have already been 
discussed (see page 32). 

Table 7 indicates that there were perhaps four times as many fami- 
lies without income in PST as in CPS. In absolute terms these families 
numbered 1.6 million in PST as compared with 0.4 million in CPS. The 
relatively large number of families without income in PST is partly ac- 
counted for by the editing procedures used in the 1950 Census. About 
one-third, or 0.5 million families without income in PST, were cases in 
which the family head had no income and income information for other 
relatives was not obtained. Although many of these families probably 
represented cases in which the head had no income but other family 
members did, they were all tabulated as having no income according 
to the income editing rules used in the 1950 Census (see discussion of 


TABLE 7 


PST AND CPS DISTRIBUTIONS OF FAMILIES BY TOTAL MONEY 
INCOME, BY COLOR, FOR THE UNITED STATES: 1949 








Total White Nonwhite 





Total money income 
PST CPS PST CPS PST CPS 





Number (thousands) 38,788 39,193 35,411 35,988 

BE ascnascsedss 100. 100.0 100.0 100.0 
Percent reporting.... 93. . . 
Per cent not reporting 6. 

Total reporting.... 
PPC r ee Tee re 
Loss, or $1 to $499.... 
$ 500to$ 999.... 
$ 1,000 to $1,499... 
$ 1,500 to $1,999.... 
$ 2,000 to $2,499.... 
$ 2,500 to $2,999.... 
$ 3,000 to $3,499.... 10. 
$ 3,500 to $3,999.... 
$ 4,000 to $4,499.... 
$ 4,500 to $4,999.... 
$ 5,000 to $5,999.... 
$ 6,000 to $6,999.... 
$ 7,000 to $9,999.... 
$10,000 and over..... 
Median income...... 
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* Comparable figures not available. 
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effects of Census editing procedures below). At present little is known 
about the remaining 1.1 million “zero-income” families. Nearly half of 
these families probably had no income as defined in the Census, if the 
CPS figure of 0.4 million is taken to be accurate. They may have been 
living on savings, charity, or gifts, or else they were newly created 
families or families in which the sole breadwinner had only recently 
died or left the family. The remainder of these families appear at the 
“zero-income” level for a variety of reasons, such as errors in enumera- 
tion and coding. 

2. Comparison of PST aggregates obtained from the tabulations for 
families and unrelated individuals and for persons. The PST tabulations 
for families and unrelated individuals yielded a considerably lower ag- 
gregate ($154.5 billion)" than the comparable tabulations for persons 
14 years old and over ($168.0 billion). Theoretically, the two estimates 
should have been about the same. The lower estimates derived from 








FOR PERSONS 14 YEARS OF AGE AND OVER 





























If this person is a family head (see definition below) 
ed b; in 194 
EE Ene income received by his relatives in this household 
Last year | Last year,how| Last year, howmuch | Last year | Last year, how | Last year, how much 
(1949), how | much money | money didhereceive | (1949), how | much money | money did his rela- 
much money | did he earn] from interest, divi- | much money | did his rela-| tives in this house- 
did he earn | working in his | dends, veteran’s al- | did his rela-| tives in this | hold receive from 
working as an | own business, | lowances, pensions, | tives in this | householdearn | interest, dividends, 
employee for | professional rents, or other in- | household earn | in own busi- | veteran's allowances, 
wages orsalary? | practice, or | come (aside from | working for | ness, profes-| pensions, rents, or 
(Enter amount farm? earnings)? wages or sal- | sional practice, | other income (aside 
before deduc- | (Enter net in- ary? (Amount | or farm? (Net from earnings)? 
tions for taxes, come) before deduc- income) 
etc.) tions for taxes, | 
LEAVE etc.) LEAVE 
BLANK BLANK 
3la 31b 3le F 32a 32b 326 G 
[_] None (] None (J None (_] None (_] None [] None 
" $ $. a $ $ 
(_] None (J None (] None (_] None [] None (J None 
$. $. $ $ $ $ 






































ll The average for the open-end interval was estimated from a Pareto curve in the tabulations for 





families and unrelated individuals, whereas $20,000 was assumed as the average for the open-end interval 
in the tabulations for persons. If $20,000 had been used as the average in :!\+ family tabulations, the 
aggregate would have been $159.1 billion rather than $154.5 billion. 
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the tabulations for families are largely attributable to the method used 
to collect family income data in the Census. 

Income information in the 1950 Census was obtained from a 20-per 
cent sample of the population. If a person came into the sample, he 
was asked questions 3la-c indicated in the excerpt from the 1950 Cen- 
sus schedule shown below. If the sample person was not a family head, 
questions 32a-c were skipped; however, if he was a family head, ques- 
tions 32a—c were asked in order to obtain income information for the 
entire family group. Although this procedure provided an unbiased 
sample of families and of persons, it increased the possibility of intro- 
ducing reporting errors. 


TABLE 8 


PER CENT OF FAMILIES AT EACH INCOME LEVEL UNDER $10,000 
WITH FAMILY INCOME GREATER THAN HEAD’S INCOME: 
PST AND CPS* 











Total money income PST CPS 
DE EE 6.06655, 50 d:o.soacd nadisesiewarkneeeonaaee 19.3 27.1 
6.6 rch eisdnanssrieennennnnntion 30.7 40.6 
re er ere a 29.6 28.0 
. « ct ckchanebees mame ameaae 33.9 41.3 
UN NEE 6: oisres se rkirei sndvdiei e160 0 erecudr eves raigiecernie 49.1 56.0 
66 660 ee snes sncdesdeainedennne 60.0 70.1 
i. ca cdigncddeesaseadsasebenenens 64.5 72.4 
EE ci pb 6cdereticcndncenadenennnnnn 65.6 74.0 





* PST data showing the number of families with more than one income recipient for the $10,000 
and over interval not available. 

As the collection procedure implies, the family income data are based 
on replies to income questions asked separately for the head of the 
family and for all relatives of the family head as a group, whereas the 
income data for persons are based on replies to income questions asked 
for each person in the sample. The failure to obtain income information 
individually for relatives of family heads probably resulted in under- 
reporting of income for this group. Evidence supporting this view is 
shown in Table 8 where it may be noted that at each income level the 
CPS data have a larger proportion of families in which the family in- 
come was greater than the head’s income. This is equivalent to saying 
that there were more families having income recipients other than the 
head in CPS than in PST. 

3. Effects of Census editing procedures. In the interest of economy, a 
relatively simple editing procedure was used in processing the 1950 
Census family income data. It was assumed that if the income informa- 
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tion was obtained for the family head, the total family income was 
known even when the income information was not obtained for other 
family members. This procedure is different froma the CPS procedure 
where any incompleteness in the returns for a family would result in 
a not reported (NA) classification for the family. It was adopted be- 


TABLE 9 
PER CENT DISTRIBUTION OF FAMILIES BY TOTAL MONEY 
INCOME, USING DIFFERENT NOT REPORTING 
(NA) CRITERIA 








Distribution if families 
in which income 
information was not 
obtained for relatives 
of the head were 
tabulated as NA 


Distribution 
Total money income as 


tabulated 





100. 
88. 
11. 

100 


2 Ss 


Loss or $1 to $499 
$ 500to$ 999 
$ 1,000 to $1,499 
$ 1,500 to $1,999 
$ 2,000 to $2,499 
$ 2,500 to $2,999 
$ 3,000 to $3,499 
$ 3,500 to $3,999 
$ 4,000 to $4,499 
$ 4,500 to $4,999 
$ 5,000 to $5,999 
$ 6,000 to $6,999 
$ 7,000 to $9,999 
$10,000 and over 
Median income 
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cause it was one of the few editing procedures which would permit 
fuller utilization of all information obtained in the Census. Some of the 
pretests indicated that a relatively large proportion of the schedules 
submitted by the Census enumerators would contain blanks in the 
items for relatives of the head which actually represented the absence 
of income rather than the failure to report the amount of income re- 
ceived. Another consideration which led to the adoption of this pro- 
cedure is the fact that in about one-half of all families the head is the 
only income recipient and even where there are other recipients the 
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head is generally the principal recipient. Although other editing pro- 
cedures were considered, only the one finally adopted fitted in as part 
of the general coding and editing scheme used in the Census. 

In general, the editing scheme used in the 1950 Census produced the 
desired results of keeping the NA rate low without seriously distorting 
the income distribution (see Table 9). The NA rate obtained using the 
1950 Census editing procedures was half of what it would have been if 
the alternative procedure had been used of classifying as NA families 
in which no report was made for family members other than the head. 
The median income based on the procedure which was used was about 
$70 lower than the median which would have been obtained from the 
alternative procedure. Nevertheless, this procedure created a down- 
ward bias in the statistics inasmuch as all the missing entries were con- 
verted to zeros, although some of them must have actually represented 
amounts. The approximate magnitude of this bias is refiected in the 
fact that the aggregate computed from the distribution of families and 
individuals by income levels prior to editing was $4 billion higher than 
the aggregate derived from the data as tabulated (after editing). This 
suggests that editing accounts for about one-fourth of the difference 
between the family aggregate and the aggregate for persons. 


SUMMARY 


The 1950 Census data which are currently becoming available pro- 
vide, for the first time, statistics on the size distribution of income for 
each locality in the United States, as well as analytical tabulations for 
the country as a whole which could not be obtained from the sample 
surveys conducted in earlier years. Experienced users of Census data 
have learned to inquire very carefully into the quality of information 
obtained by a mass enumeration of the population. For this reason, an 
investigation of the quality of these data is appropriate at the present 
time. Unfortunately, a full report on the quality of the income data 
will not be available until the results of the Post Enumeration Survey 
and the various record matching projects are completed. However, 
the preliminary investigation described above suggests that the 1950 
Census income data should be satisfactory for most purposes. The two 
major conclusions of the investigation are: (1) the family income data 
obtained in the Census are of comparable level with those obtained in 
the annual income supplements to the Current Population Survey; and 
(2) the income data for persons obtained in the Census are somewhat 
more reliable than the family income data largely because the latter 
were obtained by an inferior collection technique necessitated by the 
use of a line schedule in the 1950 Census. 





DELIMITATION OF ECONOMIC AREAS: STATISTICAL 
CONCEPTIONS IN THE STUDY OF THE SPATIAL 
STRU STURE OF AN ECONOMIC SYSTEM* 


RUTLEDGE VINING 
University of Virginia 


RECENT monograph by D. J. Bogue describes the procedures used 
by the Bureau of the Census in grouping the counties of the 
United States into a new set of 501 areas called State Economic Areas. 
Along with the description of procedure, there is a large map showing 
the Areas, a table listing the counties in each Area, a table giving for 
each Area some 88 pieces of numerical data regarded as indicating the 
non-agricultural characteristics of the Area, and a similar table giving 
for each Area about 75 pieces of numerical data presented as indicating 
the agricultural characteristics of the Area. 

The present discussion will concentrate upon the nature of these 
Areas that are designated as “functional groupings of counties” each 
containing a “distinctive economy.” Interpreted simply as a new set 
of areas for Census reporting intermediate in size between the county 
and the State, the procedure described makes quite good sense. But 
there are overtones in the discussion that will suggest to some readers 
that more was intended, that the work was carried on to such effect 
that the component “economies” or “regional economic units” of this 
nation have been discovered and approximately designated. It is this 
that many readers will look for in such a work, and I believe this seek- 
ing for a “natural” area unit for economic studies is based upon a 
fundamental misunderstanding. 

In my opinion, the spatial structure of a human economy should be 
regarded conceptually as virtually a continuum. As in other studies of 
phenomena having volume or spatial extension, empirical observations 
must be made upon the contents of finite and arbitrary spatial units, 
these empirical observations being viewed as providing an approximate 
conception of what would be viewed were the spatial units made ar- 
bitrarily smaller while the contents were being made more dense. 
From this point of view, I believe the student of the structure, func- 
tioning, and development of an economic system may regard as not 
particularly relevant the criteria used by the Bureau of the Census in 





* Review article on Donald J. Bogue, State Economic Areas—A description of the procedure used in 
making a functional grouping of the counties of the United States, Bureau of the Census, Washington, 
D. C., 1951. This paper was done with the assistance and cooperation of the Bureau of Population and 
Economic Research, University of Virginia. 
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the designation of the Areas. While census data classified for 501 
areas provide more information than if classified for only 48 States, 
less information is provided in this way than if they were classified by 
the 3070 counties. I should be inclined to interpret the choice of the 
number 501 as an economic decision in the sense that presumably the 
cost of making this number 502 was regarded by some responsible 
person or group as being greater than the value of the information 
that would be added; and the particular boundaries selected I believe 
should and can be defended on grounds other than those that may be 
found in the needs of political 2conomists studying how a human 
economy works and develops as an organization. | 

I shall attempt to develop these points, first outlining the nature of 
the purposes that seem to me to give meaning to the criteria described 
as having been used in the designation of the State Economic Areas. 
In the monograph, these purposes are confused, I think, with others 
for which the criteria are not relevant. Accordingly, I shail discuss these 
other purposes in outlining a conception of regional or spatial structure 
in the specification of which such areas as are described in this mono- 
graph would play no essential role. 


I 


Those who were responsible for delimiting the Areas were confronted 
with the problem of selecting a working rule which, together with sta- 
tistical observations, would determine the boundaries of sub-areas into 
which the entire territory of the nation is to be partitioned. An in- 
definitely large number of rules would of course be possible alterna- 
tives to the one finally selected. The selection of a particular rule would 
be made on the basis of how that rule performs in designating the 
county groupings, this performance to be measured in terms of the 
usefulness of the county groupings so designated. The technical spe- 
cialists, who did the staff work and who professionally advised those 
making the final choice, had the problem of analyzing the respective 
performance characteristics of the many possible rules. The decision- 
makers presumably chose that rule which they regarded as having the 
most desirable performance properties. 

The question that I raise now is this: In choosing a “best” rule for 
delimiting the areas, what purposes did the decision-makers have in 
mind for these particular area designations to serve? The rule selected 
would possess optimal properties in its capacity for least expensively 
generating a set of areas that attains predetermined objectives. What 
were these objectives? 
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The primary responsibility of the Bureau of the Census is that of 
providing enumerated information. It gauges the current demands for 
information, anticipates future demands, recommends to a “higher” 
authority budget expenditures upon information-gathering, seeks out 
economical means for obtaining specified information, helps this 
“higher” authority compare the costs of obtaining an increment to 
the total information gathered against an evaluation of the worth of 
this increment. I+ is continually being bombarded with requests for 
factual data from public and private administrative organizations. 
The problem always confronting the Bureau is that of choosing best 
courses of action in meeting the demands made upon it for information 
from the thousands of individual administrative decision-making units. 

From this point of view, the monograph makes good sense in de- 
scribing the procedure adopted by the Bureau in defining its new geo- 
graphic classification: 


In tabulating and publishing data, the Bureau of the Census has found an 
increasing need for a set of areas intermediate in size between counties and 
States. For many statistical purposes, State units are too large and hetero- 
geneous in their composition; whereas in many other instances county units 
are too small and too numerous to be usable. In the 1950 Census of Agricul- 
ture, for example, the use of State economic areas will provide cross-tabula- 
tions in considerable detail of data that previously have been available only 
at the State level. If made for individual counties, such tabulations would be 
‘costly and more detailed than needed for most purposes... . State eco- 
nomic areas are relatively homogeneous subdivisions of States. They consist 
of single counties or groups of counties which have similar economic and 
social characteristics. The boundaries of these areas have been drawn in such 
a way that each State is subdivided into a few parts, with each part having 
certain significant characteristics which distinguish it from the other areas 
which it adjoins. . . . In general, wherever it is not imperative that totals be 
reported for each county, the State economic areas may be used to present a 
concise body of statistics for the entire Nation, by States and their principal 
parts. Considerable savings in publication space, tabulation costs, and cleri- 
cal work can be made through the use of these units instead of counties. 
Also, State economic areas permit the tabulation of sample statistics in much 
more detail than is possible for individual counties. It is for these reasons 
that State economic areas have been established by the Bureau. (p. 1) 


The rule selected by the Bureau for delimiting the areas was sub- 
stantially as follows: 


1. The delimitation was made on the basis of statistical and other 
objective evidence; and homogeneity with respect to certain pre- 
scribed economic and social indices was “the prime criterion in 
judging the quality of the State economic areas delimitation.” 

2. Each area was required to satisfy minimum size requirements. 
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Those areas intended for general tabulations were each required 
to contain at least 100,000 inhabitants. Those intended for agri- 
cultural tabulations were each required to contain at least 10,000 
farms. Those areas designated as Metropolitan state economic 
areas were each required to contain at least one central city of 
not less than 50,000 inhabitants, and the entire Metropolitan 
area, consisting of the county containing the central city and any 
contiguous counties satisfying certain population and employment 
requirements, was required to contain not less than 100,000 in- 
habitants. 

. The areas were required to follow county lines in all cases, 
and all State boundaries were also State economic area bound- 
aries. 

. The procedure consisted of the following steps: (a) A tentative 
delimitation for each State was made, plotting on a county outline 
map for this purpose regional delineations and type of farming 
areas that were available from previous work of others; (b) these 
tentative delimitations were then tested with statistical data in 
accordance with the above requirements regarding size and homo- 
geneity with respect to specified indices on land use, level of 
living, population characteristics, type of farm composition, et 
cetera; (c) the tentative delimitations were thus revised and sub- 


mitted to other agencies for review—departments of agricultural 
economics of each State agricultural college, each of the State 
statisticians of the Crop Reporting Service, and persons or agencies 
representing a non-agricultural interest. 

. The final review and determination of boundaries were made by a 
working committee consisting of members from the Census 
Bureau and from the Bureau of Agricultural Economics. 


I have stated what seems to be the primary problem confronting 
the Bureau—economically meeting demands for enumerated informa- 
tion. Considering this as the main objective, I can readily conceive of 
the Bureau’s staff convincing me that the above rule for delimiting 
the Areas possesses something that could be regarded as optimal prop- 
erties. That is to say, the Bureau’s technical staff could demonstrate 
to my satisfaction that the set of areas generated by the above pro- 
cedure is a “better” set than that obtained, for example, by the different 
procedure used by Rand-MecNally in its partitioning of the nation 
into a set of areas. “Better” refers to the Bureau’s particular purposes 
or objectives, which in general would not be the same as the objectives 
that Rand-McNally’s areas were designed to serve. There now exists 
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‘no systematic method or theoretical basis for analyzing the perform- 
ance properties of a procedure for obtaining a set of areas comparable, 
for example, to the theoretical basis that is available for analyzing a 
sample survey design for obtaining a set of observations upon a popu- 
lation. But the problem is in essence the same. Although there is no 
known way of applying a formal test of the efficiency with which a 
given procedure generates a set of areas satisfying predetermined re- 
quirements, the Bureau nevertheless would seek a way of determining 
a set of areas that is optimum in this sense of being economical. The 
point of view of the Bureau, one may suppose, would be reflected in 
what Morris H. Hansen—who, incidentally, is chairman of the Bureau 
of the Census Committee on Statistical Areas—had to say on another 
occasion and in connection with a different problem. “ . . . The over-all 
test that we [the Bureau of the Census] apply to a sample design is 
that it shall yield the desired information with the reliability required 
at minimum cost; or conversely, that at a given cost it shall yield the 
estimates desired with the maximum reliability.”! The objectives of 
the Bureau in the case of these area designations involve presentation 
as well as estimation; and it is only in such terms of efficient statistical 
estimation and presentation that I am able to understand the size 
requirements and the requirement of homogeneity with seapect t to the 
prescribed indices as having operational meaning. 
II 

Up to now I have interpreted the Bureau’s efforts as having been 
directed to the problem of determining an optimal set of geographic 
areas for the particular purposes which the Bureau serves. This is an 
easily understood conception; but the monograph, in my judgment, 
confuses the discussion by intermixing, along with such paragraphs as 
have been quoted above, statements that imply an entirely different 
idea. At least I get the impression that within the minds of responsible 
members of the staff advising the Bureau there was an idea correspond- 
ing to some kind of “natural area unit” containing something that could 
be called a “distinct economy” that can be discerned or perceived as an 
operating “unit.” Each State was to have been “divided into its prin- 
cipal units” such that “within each unit a distinctive economy pre- 
vails.” This notion would seem to imply that “distinctive economy” is 
a term referring to a natural entity that any competent person can 
recognize when it is held up before him. In accordance with this idea 





1 Hansen, Hurwitz, and Gurney, “Problems and Methods of the Sample Survey of Business,” 
Journal of the American Statistical Association, 41 (1946), p. 174. 
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of “natural area unit,” there would be, not an optimum set of areas 
for a specified purpose among an indefinitely large number of possible 
sets of areas, but some certain and existent number of distinct quasi- 
organic entities; and the procedure that was sought by the Bureau, one 
may be led to suppose, was a procedure so designed as best to facilitate 
the discerning and setting apart of these entities. The 501 that were 
set apart would, correspondingly, not be an estimate of the “best” 
number but rather an estimate of the number. “Because Census data 
are the sole or principal source of information about many aspects of 
social and economic life,” the monograph states, “there has been an 
increased demand that the Bureau of the Census undertake to provide 
more detailed information about the major socio-economic areas in 
each of the States. The State economic areas have been established in 
an effort better to serve that growing body of analysts ... “who are 
conducting their work in terms of “functionally defined units of area.” 

The term “functional area” or “functional unit” is used at a number 
of places to describe the areas, but there is no explanation of what 
meaning the word “functional” is supposed to convey. When it is stated 
in the monograph that within each of the area units being sought 
there prevails a “distinctive economy,” an attempt is made to define 
this latter term. “The term ‘economy’ is used here in its broadest 
sense; it refers to the total adjustment which the population of an 

“area has made to a particular combination of natural resources and 
other environmental factors.” 

This statement, however, evidently has no operational content. In 
an empirical field, a definition is supposed to serve the purpose of 
setting apart those attributes by which that being defined may be 
recognized and classified under the name assigned to it. The quoted 
statement offers the following instructions: When confronted witha 
geographic area containing (1) a population of human beings and (2) a 
combination of natural resources and other environmental factors, the 
research worker is to inquire into whether or not (1) has adjusted itself 
to (2); if adjustment is observed to have taken place, then the geo- 
graphic area in question is to be classified as containing a “distinct 
economy.” But in accordance with these requirements any area whatso- 
ever inhabited by human beings engaged in exploiting “natural re- 
sources and other environmental factors” would qualify as an “area 
unit containing a distinctive economy,” for ordinarily the expression 
“engaged in exploiting” will do well enough as an equivalent for “ad- 
justment” in this context. 

No special meaning is explicitly assigned to “total adjustment,” but 
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one gets the impression that in some way the idea of “homogeneity” 
is associated with the ideas of “total adjustment” and “functional 
grouping.” The test for “homogeneity” that was improvised by the 
Bureau involved the following steps: 
The indices [intended as measures of “land use”, of the allocation of employ- 
ment among broad classes of industry, of the “level of living”, of population 
characteristics, of “type of farm composition”, of “value of farm products” 
by type of product, of “crops and yields”, of “livestock production”, etc.] for 
all counties of a State were listed on worksheets, grouped according to the 
tentative delimitation. Sums of indices for each grouping were taken, and 
the mean value of each index for each group was computed. The index 
values for each county were compared with the mean values for the two or 
more groups in which the county could be placed. Wherever there was indi- 
cation that a county would deviate less from its group mean for any index 
if it were moved to an adjoining group, the fact was noted. ... When all 
indices for a county had been compared . . . the notations of deviation were 
studied to determine whether or not jointly they indicated that the county 
should be reclassified. .. Because several index values were kt ing em- 
ployed to determine homogeneity, the indices were seldom unanimous in 
placing a county either in one grouping or in another. The indices indicating 
change and those indicating no change were considered for their relative 
importance in characterizing the two areas being separated. 


This is a plausible procedure for stratifying a population; and it is 
easily understood how the strata obtained may be found to have useful 
properties—for example as indicated above, in the improvement of the 
efficiency with which certain population parameters are estimated or 
of the effectiveness with which information is classified and presented. 
But I see no grounds for calling them by names implying more than the 
term “stratum” implies. I can see no more grounds for calling the strata 
obtained by this procedure “functional groupings” or “distinctive econ- 
omies” than if the stratification had been made in accordance with 
some kind of criterion of “heterogeneity.” No explanation is given 
which would convey an understanding of how the “adjustments” 
made by the population within any one of these strata may be regarded 
as different in essence from the “adjustments” made by the population 
within any other designated area. 

This seeking to meet the “needs” of political economists by provid- 
ing them with “functionally defined units of area” is based upon what 
I regard as a misunderstanding of the needs of political economists. 
The need for these area units was described in the monograph as 
follows (italics added): 


Numerous problems of maladjustment to the environment, of improper use of 
resources, of recurring or chronic conditions of unemployment or under- 
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employment in certain of these areas [‘that have emerged as a result of the 
adjustment of the population to the physical environment’] tend now to be 
regarded as problems of the Nation, and are studied by a variety of special- 
ists. Increased awareness of these conditions has accentuated the need for 
more knowledge about local areas. The growing concern about problems of 
these types has fostered a general tendency for analysts to descend below the 
level of State statistics, and to conduct much of their work in terms of 
smaller, more functionally defined units of area. 


In my opinion, the important needs of the political economist studying 
the structure and functioning of an economic system are illustrated by 
the questions that this statement begs. “Areas” do not “emerge” with 
this “adjustment” process. The areas have been where they are all 
along. Something else emerges, and while it is spoken of as an economy 
or an economic organization, research workers have yet to arrive at a 
generally satisfying mode of empirically observing and concretely de- 
scribing its structure and the process by which it is supposed to develop. 
By what criteria do observers “become aware” of the “impr~perness” 
or the “maladjustment” characterizing the performance of an economic 
system? Certainly not by the mere enumeration of the contents of geo- 
graphic areas however defined, for these terms imply a comparison 
and express an evaluation. To perceive “maladjustment” or “improper- 
ness” or “underness” or “overness” in an observed situation is to pos- 
sess the capacity for perceiving and making known by well-defined 
criteria the qualities characterizing states of “adjustment” and of 
“properness” in the performance of the organization of economic units. 

Work toward establishing this capacity can hardly be said to have 
begun. There are, to be sure, judgments that are daily made to the 
effect that per capita incomes in certain areas are too low or that indus- 
trial development of certain areas is lagging or that the structures of 
the economies of certain areas are oul of balance. But upon inquiry into 
the bases of these judgments, one must agree, I believe, that they issue 
from our passions and sympathies, and one will search in vain for 
analytical criteria that will stand critical investigation. There are 
analytical criteria in economic theory by which the consistency of an 
argument in a social discussion may be judged. But no positive theory 
now exists which will account for the structural and operating fea- 
tures and developmental processes of a human economy as it performs 
under acceptable conditions of freedom for individual units and which 
would assist participants in a social discussion in evaluating how well 
or how poorly an existing economy is performing and in making explicit 
the meaning of some such notion as “significant divergence” from 
“proper performance.” The work at this stage is primarily experi- 
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mental in the efforts that are being made to discover useful ways of 
describing the structural features and the operating and growth char- 
acteristics of an economic system. The Bureau contributes to this 
work by responding to requests for information. But it cannot inform 
a research worker in regard to the set of areas he should obtain observa- 
tions upon any more than it is in a position to specify for administrative 
agencies what set of areas is “best” for their respective administrative 
purposes. For a given research worker dealing with some given problem 
of description or of hypothesis-testing there may exist an optimum set 
of areas upon which observations are to be made for this piece of re- 
search. There exists no unique or single set of “natural area units” as 
distinct operating entities and component parts of a human economy. 


III 


The implication of the last statement is that Lhe eco: »my, in its 
spatia’ aspects of structure and functioning, is to be regarded as a con- 
tinuum. It occupies area but it is not to be identified with the area 
occupied, and it has an objective form of its own as pattern and struc- 
ture. The economy is an organization of economizing units. The be- 
havior to be studied is not that of the units but rather of the system 
of units. As 2 population system, the organization is characterized by 
structural and operating features, and these pertain to the system per 
se and not to any particular set of mortal economizing units that may 
exist at any specified point in time. 

A statistical description of the form and pattern assumed by this 
system may be made, I believe, without reference to any particular 
set of geographic sub-areas. For this purpose I would be inclined to 
develop certain of the ideas of Walter Christaller. Christaller studied 
what he looked upon as asystem of central places.? The central places 
are the cities within an economy, and he classified cities into discrete 
types, each type having its characteristic function to perform within 
the system. The pattern that he described was that formed by the 
spatial orientation of these central places of various types. His principal 
contribution, I think, lies not in his description of what he thought he 
saw as a particular pattern but in the steps that he took towards a 
specification of the attributes of spatial pattern in general. His attri- 
butes of pattern include the following: operationally definable types or 
classes of population clusterings or central places; a natural ordering or 
gradation of these central places into a hierarchical system of centers; 





2 Die Zentralen Orte in Suddeutechland, Jena, 1933. 
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numerical regularity within this system with respect to the frequency 
of central places of a given type or size; a numerically specifiable spatial 
orientation of the different types of central places. 

More recently, Bogue, the author of the monograph reviewed above, 
studied what he regarded as the structure of metropolitan communi- 
ties.? He did not present data for individual metropolitan communities 
but only average figures for two large classes of communities for each 
of several parts of the nation. As distance from center is increased, his 
data describe population density as sloping downward in approximate 
conformance with a definite rule. Whereas Christaller’s conception 
calls to mind a configuration of population density peaks, Bogue’s 
study suggests a filling in around each peak to form a density configura- 
tion that may be conceptually thought of as specifiable in terms of a 
density function. Thus, the statistical conception of a density function 
may be employed in this description, and Christaller’s notion of a 
system of central places may be modified into the idea of a spatial 
density configuration of economic units. This latter would be observed 
if the locations of the family and individual units at a point in time 
were to be plotted as coordinates in space, and Christaller’s central 
places would be represented by the points within the area at which 
localized peak densities are observed. 

From this view, the United States, or any other large populated 
area, appears as an interconnected system of central places. The 
density of the economic units is at a local peak within each of these 
central places, but it does not fall off abruptly at anything that can be 
called an edge or limit of these cities. It declines systematically with 
distance from the center, and for the major centers the decline con- 
tinues for a hundred or more miles out, until an area of dominance of 
an adjacent major center is reached; and then population density rises 
relatively smoothly to another peak. Passing along connecting links 
between the major centers, one sees rises to and falls from smaller 
density peaks at sub-centers. Each major center appears as a hub 
with spokes, so to speak, extending to other centers. The sub-centers 
associated with a given major center may be seen as lower order hubs 
with extending spokes. That which appears as a principal central place 
from the point of view of a particular set of centers may appear as a 
subsidiary central place from the point of view of a larger set of places; 
and the hierarchy of centers scales up finally to a nucleus for the entire 
system. 





8 The Structure of the Metropolitan Community—A Study of Dominance and Subdominance, Univer- 
sity of Michigan, 1949. 
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The connections that link the density peaks into a system consist of 
the transport and communications networks, and the entire configura- 
tion may be visualized without the aid of any sub-area designations 
regarded as economic regions. Figure I affords a rough visual impression 
of the configuration covering this nation’s area. The densities shown 
are on a scale such that only the major centers are indicated and refer 
to highway traffic rather than to the economic units within the nation. 
The two densities would surely be correlated, however, and this figure 
illustrates what I have in mind by a spatial density configuration of 
economic units. 

The spacing and clustering of economic units constitutes a spacing 
and clustering of complementing economic activities. The density 
peaks represent clusterings of certain types of economic activities, and 
these central places are formed into systems and sub-systems of central 
places. This systematic formation may be rationalized and described 
in common sense terms as follows. Some of the economic units are 
engaged in activities the locations of which are determined by the 
locations of natural resources and thus are dispersed over space. The 
expectation is that each area of substantial size will be endowed with 
natural resources ranging from those of a type any member of which 
is found in only a very few places to those of a type the members of 
which are widely distributed among most areas. All areas contain 
alike these latter resources, and each contains some one or more of the 
rarely found resources. The dispersed units are supplied and serviced 
from central places. In the smallest central places the servicing and 
supplying operations involve activities the products of which have v4e 
smallest market range. The type of value added in these centers 1s 
such that there is a relatively large rate of consumption per consuming 
unit and also such that the rate of production of a tolerably efficient 
producing unit is relatively small. Among the units serviced and sup- 
plied are those whose product specializations may consist of outputs 
of that type of resources found only in a few places. The mean dis- 
tance moved from point of origin to destination by the value-added 
embodied in the products of these units is relatively great. From this 
extreme, the mean distances covered by resources and products of 
other types range down to nearly zero; and in these small centers the 
proportion of units of value-added shipped that terminate at some 
given distance declines sharply as distance is increased. 

Several of these small central places are grouped into a low-order 
system of places centered upon a somewhat larger place in which are 
found, in addition to the smallest-range central goods and services, 
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other central goods and services with a somewhat larger market range, 
The rate of consumption of these latter central goods per consuming 
unit will be smaller than that of the goods of the smaller range so that a 
larger number of consuming units will be required to take a given rate 
of output. Or the size of the efficient producing unit may be greater than 
that of those typical of the smaller center so that more consuming units 
may be required in order to keep a producing unit going than are 
available within the range of the smallest central place. Also, in this 
second sized central place the chance is better than for the smallest 
place that there will be located there some form of processing requiring 
an assembling of resources not all of which are found within the range 
of the smallest order of place. Thus, in these second-order centers there 
are, as in the small centers, economic units engaged in activities pro- 
ducing goods and services with market ranges varying from the rela- 
tively large to virtually zero, and for these centers also the proportion 
of value-added shipped that terminates at some given distance de- 


, clines sharply as distance is increased. The two types of centers may 
have similarly shaped distance distributions, and in the short-distance 


range of the scale the types of products and services are the same in the 
two distributions. In the longer distance range of the scale the second- 
order centers will tend to show a somewhat higher degree of product 
diversification. 

Each of the second-order central places has its subsidiary elementary 
centers about it, and this constitutes the lowest-order system of cen- 
ters. Several of these are grouped about a third sized place to form a 
second-order system of centers. This third-order central place will in- 
clude, in addition to the activities of units providing all the types of 
central goods and services found in the smaller central places spaced 
around it, economic activities providing central goods and services 
with ranges reaching out over the entire system of which it is a nucleus 
or center. All that was said in comparing the respective distributions 
of market ranges for the two lower-order types of central places may 
now be said in regard to this third-order place as compared with the 
next place below it in the ordering of types of places. A place of this 
third order is more likely to contain processing activities involving an 
assembling of resources not found in all areas, and the tendency will be 
for this assembling to draw from a larger area than is the case for the 
smaller centers. The products sent the longer distances from these 
larger centers will typically show a greater diversification than those 
sent from the smaller centers. Several systems of this higher order are 
oriented with respect to a still larger central place, and so on; and a 
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comparison of the situation to be found in any given order of place 
with that characterizing the next lower order of place would involve 
the same kind of generalization. A central place of any given order is a 
source of central goods and services of all the types available in any 
of the lower-order central places within the system of which it is a 
center. In addition it is a source of central goods the market ranges of 
which cover these centers and areas included with this system. And 
in addition to this it is an assembling and processing point for resources 
and products peculiar to the territory covered by its system of centers 
and destined to be shipped to the central places of other systems of 
centers. 

In the above description, the central places are regarded as points 
of dispersion and points of absorption of economic flows. From each of 
these centers, value and physical products and services are continually 
moving outward, counterbalancing the inward flows of value and 
physical products. For a given center the destination points would form 
a density configuration and likewise: would the origination points 
fur the flows coming into the center. Thus, a second kind of density 
configuration has been introduced in the description of the structure 
of the system. The pattern formed by the spatial distribution of the 
economic units is specified b, the first kind. The second kind describes 
the spatial dispersion of (a) the destinations of the economic flows 
emanating from any given concentration of units and (b) the origina- 
tions of the economic flows terminating «>i any given concentra- 
tion of units. Suppose that for a given central place and time period 
the destination were known for each product or value addition pro- 
duced by the units served by the central place. Then, the distance from 
the center to the destination for each unit of value-added would be a 
variate for which a frequency distribution could be formed. This dis- 
tribution would describe the density configuration of the destinations 
of the flows originating in the center; and in a similar way there could 
be constructed a distribution describing the density configuration of 
the originations of the flows terminating in the center. 

As an illustration of what I have in mind as this second kind of den- 
sity function, I am including here Charts‘ 1-7. These empirical distribu- 
tions conform only roughly to the idea. They refer to carload rail 

4 These charts were prepared by Jere Clark with whom I am working in the study of these data 
and who is preparing a detailed study of economic flow interconnections as these are indicated by the 
1.C.C. Waybill data. This work is being done under the auspices of the Bureau of Population and Eco- 
nomic Research of the University of Virginia. This Bureau is also making possible a study by Carl 
Madden pertaining to the development over time of the system of central places represented by the 


cities of this nation and a study by Carlyle Baskin of the works of Walter Christaller as an illustration 
of empirical work dealing with the spatial structure of a population system. 
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Cart la. Relative Frequency Polygon Showing the 252 Commodity 
Groups, by Average Distance Hauled by Rail, United States, First Quarter, 
1947, 1948, 1949, 1950. 
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Source: Data derived from I.C.C. Carload Waybill Analyses, Statements No. 517 (1949) and 
No. 5130 (1950). 


Cuart 1b. Cumulative Relative Frequency Distribution of the 252 I.C.C. 
Commodity Groups by Distance Hauled by Rail, United States, First Quarter 
1947, 1948, 1949, 1950. 
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Source: Data derived from I.C.C. Carload Waybill Analyses, Statements No. 517 (1949) and No. 
5130 (1950). 








1953 


dity 
rter, 


3300 


and 


a 94 
‘ter 


000 


10. 





DELIMITATION OF ECONOMIC AREAS 59 


tonnage and not to value-added in all products and services, and the 
originations and terminations are classified by State and not by central 
place. But they are illustrative nevertheless. Charts la and 1b show 
the distributions for a succession of years of the average distances 
hauled for some 252 commodity groups. The distributions approx- 
imately conform to the same logarithmic normal distribution. It can 
be shown that from year to year a commodity group will be found 
within the same general neighborhood of the distance scale, although 
over longer periods there may be expected systematic shifts of com- 
modity groups along the scale that do not necessarily alter the shape 
of the entire distribution. This chart represents a classification of com- 
modity groups and indicates a degree of stability in the typing of 
commodity groups in accordance with average distance hauled; but 


Cuart 2. Cumulative Relative Frequency Distribution of Cars of Rail 
Freight Classified by Distance Hauled, All Commodities, United States, 1949, 
and lst Quarter of 1947. 


(Plotted on Logarithmic Probability Paper) 
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Source: I.C.C. Carload Waybill Analyses, Statements No. 4822 (1947) and No. 5111 (1949). 
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it does not represent the idea of the distance density function referred 
to in the previous parayraph. Chart 2 presents a different arrangement 
of the data, and the distributions shown are in rough conformance 
with this idea of distance density function. It is for the United States 
as a whole and shows the proportion of total tonnage terminating 
within any specified distance of the point of origin. The plotting is 
done in such a way that a straight line indicates a logarithmic normal 


Cuart 3. Cumulative Relative Frequency Distribution of Carloads of Rail 
Freight Originated in Alabama and Virginia, by Distance Hauled, Manufac- 
tures and Miscellaneous, 1949. 
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Source: I.C.C. Carload Waybill Analyses, Statement No. 5110 (1949). 


distribution and it may be seen that the distribution approximately 
conforms to the same straight line for the periods shown. Two parame- 
ters being sufficient for specifying a straight line, this would mean that 
a knowledge of two parameters is sufficient for the description of the 
distance distribution of tonnage hauled by rail; and these parameters 
were stable during the period shown. 

When the national distribution is broken into the component State 
distributions, these State distributions are found to be characterized 
by essentially the same commodity groups within one end of the scale 
but by wide divergency in the other end, each State typically having 
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a commodity specialization in the long distance end. Distributions for 
Virginia and Alabama are shown in Charts 3, 4, 5, 6, and 7. These 
charts indicate for given origination points the proportion of traffic 
terminating within any given distance of those points, the origination 
points being grouped by States. Various ways of plotting have been 
employed. Charts 3 and 4 illustrate the similarity between the dis- 
tributions of the two States. Chart 5, plotted on logarithmic probability 
paper, shows the similarity between the distribution for a State and 


Cuart 4. Cumulative Relative Frequency Distribution of Carloads of Rail 
Freight Originated in Alabama and Virginia, by Distance Hauled, Manufactures 
and Miscellaneous, 1948. 
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Source: I.C.C. Carload Waybill Analyses, Statement No. 5038 (1948). 


that for the Nation. Charts 6 and 7 show logarithmic normal fits to the 
distributions of a State, one distribution for the terminations of traffic 
originating in Virginia and one for the originations of traffic terminat- 
ing in Virginia. Six hundred miles is apparently far enough to include 
the bulk of the termination points for rail traffic originating at points 
within a given State and of the origination points for rail traffic ter- 
minating at points within a given State. But these distributions refer 
only to long distance traffic, and a very much shorter distance would be 
sufficient for including the same proportion of all the value-added, 
originating or terminating. The relatively stable flow of traffic that 
can be observed over long distances is stable by virtue of a continual 
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Cuart 5. Cumulative Relative Frequency Distribution of Carloads of Rail 
Freight Originated in 1948 and in the United States in 3rd Quarter, 1947, by 
Distance Hauled, All Commodities. 


(Plotted on Logarithmic Probability Paper) 
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Source: I.C.C. Carload Waypbill Analyses, Statements No. 4822 (1947) and No. 5038 (1948). 


adding-to and dropping-out process, the additions on the average going 
relatively short distances before dropping out. 

This mode of describing the spatial structure of an economic system 
does not make essential use of sub-area designations as analytical 
concepts. When a particular place is under consideration, the essential 
matter is its orientation, type, and role within the density configuration 
or system of which it is a part. Its flow connections are shown by its 
distance distributions—the terminations of its physical and financial 
flow originations and the originations of its physical and financial 
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Cuart 6. Percentage Frequency Distribution of Observed Number of Cars 
of Freight Originating in Virginia Shipped Given Distances, Compared with the 
Number Indicated by the Logarithmic Normal Distribution, by Distance Inter- 
vals, Manufactures and Miscellanecus, 1948. 
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Source: Data derived from I.C.C. Carload Waybill Analyses, Statement No. 5038 (1948). 


Cuart 7. Percentage Frequency Distribution of Observed Number of Cars 
of Freight Terminating in Virginia Shipped Given Distances, Compared with the 
Number Indicated by the Logarithmic Normal Distribution, by Distance Inter- 
vals, Manufactures and Miscellaneous, 1948. 
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flow terminations. The basic idea in this description is that of a density 
function; and that part of theoretical statistics which analyzes the 
conditions that generate distributions having specified forms is rele- 
vant in considerations directed to explaining the existence of the par- 
ticular forms of the distributions observed and the processes by which 
these forms have developed. 

There would seem to me to be little basis for doubt regarding the pos- 
sibility of constructing a mechanical model that would distribute and 
from period to period redistribute individual entities among the com- 
ponent small squares of a large partitioned area in such a way as to 
illustrate certain aspects of the time process by which economic units 
spread themselves in the course of many decades over an area such as 
this Nation’s. In my opinion the model in conception could be made 
more realistic than the mechanical models that have been constructed 
in the study of economic time series phenomena. There are specialists, 
of course, who can analyze a situation of this sort to the extent that 
the problem is adequately formulated. The end result would give the 
expected numbers of elementary areas containing 0, 1, 2, 3,---, 
individuals after the passage of a time interval of some given length, 
and the expected distances between density peaks of specified sizes 
would be indicated. With the continuing passage of time, the system 
would develop at varying rates in its different parts. The analysis of 
the process would throw light upon the expected quantitative charac- 
ter of these divergencies in growth rates. The contribution of this 
would seem to be obvious; for although much of social discussion deals 
with comparisons and evaluations of regional (or national) growth 
rates, there is at present an absolute lack of any analytical standard 
by which to ju:'ge critically the observed divergencies in growth rates. 
“Normal” divergency in growth rates and “normal” concentration of 
population—concepts implicit in the comparisons made in many in- 
stances of social discussion—refer to the “expected” divergency and the 
“expected” spacing of individuals in a system that is functioning 
“normally.” Analytical knowledge of these latter has yet to be pro- 
vided; and evaluative comparisons that implicitly assume that in a 
“normally” working system the different geographic parts of an eco- 
nomic system should be developing at approximately the same rates 
or that there is something “unhealthy” about the functioning of a 
system in which some parts are showing negative growth rates ob- 
viously beg the question of the attributes of a “normally” working 
system. 
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LABOR PRODUCTIVITY IN THE SOVIET UNION* 


Irvine H. Srecen 
Twentieth Century Fund, Washington 


I 


HE productivity of labor, especially in “industry,”! is a subject of 
, eon interest in USSR. Its measure is described in an official 
handbook as the “most important national economic index,”* and is 
projected with production goals in the directives setting forth the 
five-year plans. The term is probably mentioned more often in Soviet 
popular and technical literature than in our own.* The concept plays 
a vital role in the “scientific” socialist theory of economic evolution 
and in the advertised Soviet strategy for achieving the twin objectives 
of full communism and the end of “capitalist encirclement.” Lenin’s 
statement of this role, repeated by Stalin in 1929, has since become a 
Soviet commonplace: 

In the last analysis, productivity of labor is the most important, the princi- 

pal thing, for the victory of the new social system. Capitalism created a pro- 

ductivity unknown under serfdom. Capitalism can be utterly vanquished, 


and will be utterly vanquished, by the fact that socialism creates a new and 
much higher productivity of labor.® 


With the adoption of comprehensive “planning” in 1928, a systematic 
program was inaugurated to supply the material basis for achievement 
of the long-range Soviet objectives. Agriculture has been collectivized, 
and many of its operations mechanized. Industry has been rebuilt, 
expanded, electrified, and otherwise modernized. The urban labor force 
has been enlarged by streams of “surplus” rural population, and its 
skill has been progressively elevated by means of on-the-job and voca- 





* Revision of papers presented at the December 1951 meeting of the American Statistical Associa- 
tion and at the April 1952 meeting of the New York Chariter. The author is grateful to P. R. Lever, who 
provided translations of Russian materials in the course of a study conducted in 1949-51 for the Johns 
Hopkins University Operations Research Office; and to 8. Fabricant, W. Galenson, and A. Gerschenkron 
for their critical comments on the original paper. 

1 Includes principally manufacturing, mining, and electric power supply. 

2 Slovar Spravochnik po Sotsialno-Ek icheskoi Statistike (Dictionary-Handbook for Socio- 
economic Statistics), 1948, p. 397. 

3 There is no justification for concluding, however, that productivity is relatively “underemphasized” 
in U. S. (See S. E. Harris, Economic Planning, New York, 1949, pp. 49-54.) The notion of productivity 
is fundamental to competitive enterprise, but businessmen’s decisions concerning it are usually couched 
in the language of accounting—in terms like “unit labor cost,” “unit cost,” and “profit.” Furthermore, 
the Soviet press, aptly described by Professor Inkeles of Harvard as a “mass trade journal,” reflects by 
design the state’s preoccupation with problems of production and productivity. 

‘ On the place of productivity in Soviet theory, see J. H. Towster, Political Power in the USSR: 
1917-1947, New York, 1948, pp. 410-11, or A. Vyshinsky, The Law of the Soviet State, New York, 1948, 
p. 60. The term “communism” has, since Lenin, been used to describe a “higher” form of “socialism” 
characterized by distribution of output according to need and the “withering away” of the coercive 
state. 

5 Quoted by J. Stalin, Selected Writings, New York, 1942, p. 135. 
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tional training. Application of the principle of the “universal obligation 
to work” has meant the employment of many types commonly found 
outside the labor force in our own country. 

Coercive measures, institutional arrangements, and propaganda 
direct the energies of the “workers and peasants”—the new “collective 
masters” of society—toward the prime task of plan fulfillment. The 
captive trade unions encourage their members to improve skills and 
increase productivity. Western-style unemployment was “abolished” 
in 1930. Absenteeism, turnover, and labor mobility are restricted by 
stern penalties. The regular hours content of the work year has been 
steadily increased since the early 1930’s. The so-called “shortest work 
day in the world” is officially 8 hours long, and overtime is obligatory. 
The 6-day work week became standard on the eve of World War II. 
The relatively small annual output of consumer goods is distributed in- 
sofar as practicable on an incentive-pay basis. Honors, prizes, and 
privileges are bestowed upon exemplary workers. Even before the 
advent of Stakhanovism in 1€35, the techniques of “socialist competi- 
tion” were stressed for advancing backward workers, for generating 
“labor enthusiasm,” and for hastening maturation of the socially 
oriented “new Soviet man.” 

What has been the impact on labor productivity of these and other 
features of life under Soviet planning—on the course of productivity 
and the standing of USSR in comparison with advanced “capitalist” 
countries? To questions such as these the rest of this paper is devoted. 


II 


It must be recognized at the outset that certain practical and theo- 
retical difficulties preclude the derivation of altogether satisfactory 
quantitative answers to these questions. Among the practical obstacles 
are the limited quantity, uneven quality, and inadequate documenta- 
tion of the Soviet data and indexes relating to production, labor in- 
put, and productivity.® Virtually full-coverage statistics are preferred, 
but these are compiled only as an incident to other operations. The 
personnel involved are generally overburdened, insufficiently trained, 
and inadequately equipped with mechanical computing instruments. 





* Much has been written in recent years about the peculiarities of Soviet statistics. See, for example, 
articles by A. Bergson, C. Clark, M. Dobb, A. Gerschenkron, N. Jasny, 8. A. Rice, and others in 
Review of Economics and Statistics, November 1947, February 1948, February 1950, and February 1952; 
N. Jasny, “International Organizations and Soviet Statistics,” Journal of the American Statistical 
Association, March 1950, pp. 48-64; S. A. Rice, “Statistics in the Soviet Union,” Bulletin of the Atomic 
Scientists, June 1952, pp. 159-62; M. Mead (and Associates), Soviet Attitudes Toward Authority, New 
York, 1951, pp. 45, 120-21; and the summary of a 1950 “Methodology Conference of Central Statistical 
Administration USSR,” American Statistician, June-July 1952, pp. 16-22. O. Morgenstern, On the 
Accuracy of Economic Observations, Princeton, 1950, p. 13, refers toa “report”—most probably apoc- 
ryphal—that “lie coefficients” were applied by the central statistical authority to data submitted to it 
during the early 1930's. 
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The pressure to fulfill plan commitments, moreover, encourages es- 
tablishments to distort and exaggerate their accomplishments. Au- 
thorities decry such abuses, but policing of the reporting system remains 
lax. The finished statistics are used for administrative purposes and, 
after proper selection and landscaping, for domestic and foreign propa- 
ganda, too. 

Of particular interest are the limitations of productivity statistics 
for the important “industrial” sector and its components, which are 
called “branches.” The available figures, especially those for recent 
years, refer mainly to ovtput per worker. Since variant, inconsistent, 
and ambiguous figures have been published for some years, the official 
industrial productivity series cannot be stated definitively. Since the 
suppression of detailed data on output, employment, and hours of 
work began in the mid-1930’s, foreign students must generally confine 
any independent reconstruction of the Soviet record to the pre-war 
years. Although Soviet analysts made international productivity com- 
parisons in the late 1930’s, no similar estimates have been released 
for the postwar years. In fact, the prewar comparisons are still cited in 
Soviet literature, and 1929 is still erroneously mentioned as the Ameri- 
can peak year. 

The formula used for constructing the Soviet index of industrial 
output per worker has most probably varied, yet the index is commonly 
supposed to be a quotient of the composite “gross” production and 
“wage-earner” employment measures.’ Test computations made by 
this writer for 1928-35 suggest that some sort of weighted average of the 
productivity relatives for the major branches was used for at least this 
interval.* A labor-weighted average of branch productivity indexes-—— 
the kind of formula sponsored in our own country by technicians of the 
WPA National Research Project and the U. S. Bureau of Labor 
Statistics—was officially computed, together with the quotient of the 
composite production and labor measures, for about five years fol- 
lowing May 1943.° It is not known which of the alternatives was pub- 
lished for which years. 





7 A position somewhat different from the one taken here is maintained by W. Galenson, “Russian 
Labor Productivity Statistics,” Industrial and Labor Relations Review, July 1951, pp. 497-98, and 
“Trends in Soviet Industrial Productivity” (Rand Corporation, P-276), March 6, 1952, pp. 1-4. 

The “wage earner” figures refer to rabochiye. They may exclude apprentices (ucheniki), whose con- 
tribution to output is, however, included in the productivity numerator. 

® See, also, E. L. Granovskii and B. L. Markus, Ekonomika Sotsialisticheskoi Promyshlennosti 
(Economics of Socialist Industry), 1940, pp. 475-79. 

® Slovar Spravochnik comments on the concurrent computation of the labor-weighted average in 
the 1944 edition, pp. 218-19, but omits reference to it in the 1948 edition, p. 398. This formula was 
preferred by leading Soviet students but apparently gave disappointing results in the reconversion 
period. D. V. Savinskii, Kurs Promyshlennoi Statistiki (Course in Industrial Statistics), 1949, p. 208, 
suddenly announced its loss of official status after several pages of favorable comment—as though the 
information was received near press time. 
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The gross output (valovaya produktsiya) measures which underlie 
the productivity indexes for industry and its components have been 
roundly criticized inside and outside USSR." Various features which 
clearly lead to an unduly favorable picture of Soviet growth have been 
well publicized—like the inclusion of defective and incomplete goods 
and the incorporation of new products and new models (especially 
before 1937) with weights reflecting the price inflation which occurred 
after 1926-27, the “fixed” base year. But other features which have 
received less attention also inflate the apparent volume of output and, 
at least for some years, also exaggerate the changes in production and 
productivity with respect to the base levels. For example, the output 
aggregates include the cost of materials and other elements of “gross” 
price. The extent of duplication depends, not only on the degree of 
integration and on the reporting practices of establishments, but also 
on the significance of (net) imports of raw and semi-fabricated materials 
consumed in industry. Even if the measure of output were neutral to 
changes in degree of integration, domestic output during the war was 
doubtless overstated (and the American contribution to it under- 
stated) by the inclusion of substantial Lend-Lease shipments of steel 
and other materials. Similarly, the gains recorded in output and pro- 
ductivity after the war reflected in part the inclusion of materials 
acquired through trade agreements and as booty, reparations, and 
shares in the production of joint extraterritorial corporations. 

Another source of inflation of output and productivity is the very 
definition of gross output. The Soviet concept is generally discussed as 
though it referred only to the “physical” volume of finished goods and 
semi-manufactures, but it also includes an adjustment for inventory 
changes and numerous expenditure items presumably reckoned in 
“fixed” rubles. Among the latter are: communal and other incidental 
services, internal plant additions, maintenance and repairs, product 
development cost, contract work, and certain cancelled orders."' The 
opportunities for confusing input with output are obviously abundant 
in a newly industrializing country which frowns upon markets and 
establishes prices administratively. Soviet writers themselves have 
often criticized tendencies to count all work done by “socialist” labor 
as “productive”; to improve the apparent performance of an establish- 





10 See publications by Gerschenkron and Jasny cited in footnote 6. Among the authoritative out- 
spoken Soviet critics before the war were Sh. Ya Turetskii and A. I. Rotshtein. 

11 See A. I. Rotshtein, Problemy Promyshlennoi Statistiki SSSR (Problems of Soviet Industrial 
Statistics), I (1936), pp. 149-67, or Savinsxii, op. cit., pp. 93-100, or Bukhalterskiit Uchet: Sbornik 
Vazhneishikh Rukvodyashikh Materialov (Bookkeeping Accounts: Compilation of Most Important Guid- 
ing Materials), 1948, pp. 300-06, or Slovar Spravochnik, 1948, pp. 112-16. 
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ment by padding the “other business” account; to record defective or 
unwanted goods as output; and to include illegally in industrial pro- 
duction any major construction projects executed with plant person- 
nel." Trotsky, who anticipated many of the criticisms made by Ameri- 
can students of Soviet statistics, noted particularly the inflation of out- 
put aggregates by the inclusion of repair, so that apparent output 
increases as quality diminishes: “It is not always certain what hides 
behind [the ruble of output]—the construction of a machine or its 
premature breakdown.” 

In 1949, current weights were supposed to replace the largely fic- 
titious 1926-27 prices; and other changes had been made earlier. 
But the industrial output and productivity indexes remain subject to 
idiosyncrasies of the gross product concept and will presumably re- 
tain much of the buoyancy so desirable from the standpoint of propa- 
ganda. 

It would be a mistake, however, to suppose that Soviet planners and 
analysts have been misled. In 1936, industrial production statistics 
based on another concept (tovarnaya produktsiya) were introduced, 
but not made available publicly. These figures have apparently been 
used in planning since 1940. The concept is restricted to goods in 
“finished” or “tradable” form, and the prices are current. 

The Soviet gross output definition and measurement practices 
doubtless lead to industrial production and productivity indexes which 
rise more spectacularly than conceivable alternative indexes computed 
by the most preferred Western methods. Conversely, the application 
of the Soviet concept and methods to American data should lead to 
indexes rising more steeply than the standard U.S. series. Western 
practice would favor the “finished” product concept in lieu of “gross”; 
a chain index for branch output (with or without adjustment of the 
links for deficiencies of coverage) in lieu of a fixed-base index with 





12 For some recent evidence of dissatisfaction with accounting practices, see A. Arakelian, Industrial 
Management in the USSR, Washington, 1950, especially p. 142; M. Yampolsky, “The Planning of 
Labour,” Soviet Studies, July 1952, pp. 78-94; and New York Times, August 4, 1952. 

4 L. Trotsky, The Revolution Betrayed, New York, 1937, pp. 12-15. 

4 J, A. Sholomovich, Analiz Khozyaistvennoit Deyatelnosti Promyshlennogo Predpriatiya (Analysis 
of Economic Activity of the Industrial Enterprise), 1949, p. 29. According to a recent announcement 
(New York Times, March 13, 1952), production and productivity calculations will be made on the basis 
of 1952 (in some cases, 1951) prices, and new items will be introduced at comparable 1952 prices. ‘This 
procedure is consistent with the 1949 change from 1926-27 weights, and hence does not appear to be a 
new departure. 

8 See, for example, P. Kholodnyi, “Planirovaniye Tovarnoi Produktsii” (Planning of Goods Pro- 
duction), Planovoye Khozyaistvo (Planned Economy), 1940, No. 4, pp. 48-52; Savinskii, op. cit., pp. 
112-13; and Bukhalterskii Uchet, pp. 308-12. Gerschenkron’s assumption that the “gross” and “finished” 
production concepts are close seems untenable in view of the abuses permitted by the former concept and 
complaints of Soviet technicians leading to the introduction of the latter. See his A Dollar Index of 
Soviet Machinery Output, 1927-28 to 1987 (Rand Corporation, R-197), April 6, 1951, pp. 6-7. 
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pseudo-price weights assigned to new products and models; and, in the 
event a fixed base is desired, a recent weighting pattern in lieu of a very 
early one. For industry as a whole, Western practice would favor the 
combination of branch output indexes with net (i.e., value-added or 
employment) weights, but this preference may not critically affect the 
results. On the other hand, the characteristic Western treatment of 
nonhomogeneous output like machinery—either exclusion from the 
industry total or incorporation by means of a coverage adjustment— 
could well iead to understatement of the true rates of Soviet industrial 
output and productivity growth. 

Finally, some remarks on the theoretical limitations of the index- 
number technique are in order. Even if complete, ideal data were 
available, it would still be impossible to construct unique or universally 
acceptable temporal indexes for USSR or any other industrializing 
country undergoing fundamental structural change within a brief 
period. Index-number statisticians in “mature” Western countries are 
probably not so concerned as they ought to be over this important 
fact. 

What applies to temporal indexes also applies to “spatial” ones— 
to international comparisons of multiproduct industry branches or of 
industry as a whole. Unique or universally acceptable measures cannot 
be derived for countries which differ significantly in technology, tastes, 


price patterns, and assortments and specifications of products. Diffi- 
cult practical problems of valuation also arise when the countries do 
not engage in substantial trade with each other or with a common 
third country. In these circumstances, there is a temptation to over- 
state the significance of comparisons restricted to industry branches 
with pseudo-homogeneous products, like “steel” or “coal.” 


III 


When the most favorable Soviet claims concerning industrial output 
per worker are pieced together, a picture of remarkable growth emerges." 
The 1950 figure was supposed to be 37 per cent above the 1940 level, 
and the 1951 figure a modest 10 per cent above that for 1950. The 1950 
figure was also supposed to be almost 5 times the 1928 index, almost 


18 The series discussed here is based on Sh. Ya Turetskii, Vnutri-Promyshlennoye Nakopleniye v 
SSSR (Intra-Industrial Accumulation in USSR), 1948, p. 30, for 1900, 1940; N. S. Maslova, Proizvo- 
ditelnost Truda v Promyshlennosti SSSR (Labor Productivity in Soviet Industry), 1949, pp. 27, 42-43, 
for 1921-39; certain Pravda issues for postwar annual link relatives and for the relation of 1950 to 1940; 
and some other sources for other dates. 

The figures for the earlier years refer to “large-scale” industry, to establishments meeting the mini- 
mum employment or power criterion set up for the 1913 census. With the development of Soviet in- 
dustry, the criterion has diminished in importance and appears finally to have been abandoned 
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7 times that for 1913, and almost 10 times that for 1900. Setbacks are 
implied for 1940-41 and for the early reconversion period; but none 
is now shown for any year during the first plan period despite con- 
temporary evidence. The series available for individual branches for 
the years prior to World War II generally indicate above-average 
gains for producers’ goods and below-average gains for consumers’ 
goods. 

Available statistics for output per man-hour show gains of a similar 
order, except that the rise during the first plan period (1928-32) is 
sharper and the rise in the third plan period (1938-40) not so steep.’” 
If man-hour productivity figures were also reported for the postwar 
period, they would doubtless show a smaller advance than 37 per 
cent beyond the 1940 level because the hours content of the work year 
increased. 

Two variant estimates of output per worker derivable for 1950 are 
somewhat smaller than the figure cited above but are, nevertheless, of 
the same order. One, based on a lower 1940 level plus the 37 per cent 
gain for 1940-50, is 4.5 times the 1928 index.'* The other, based on 
reported gains of 41 percent for the first plan period, 82 percent for 
the second, and 32 per cent for the third (plus the 37 per cent gain for 
1940-50), is 4.6 times the 1928 index." 

The average annual increases claimed for Soviet worker productivity 
are, of course, much greater than the 2 per cent long-term rate com- 
monly cited for our own country. Furthermore, no decisive slackening 
of the rate is evident through time. During the half century 1900-50, 
the claimed average annual rate was 4.7 per cent. For the period since 
the eve of World War I, 1913-50, the average rate was 5.2 per cent. 
For the period since the inauguration of planning, 1928-50, the average 
was 7.5 per cent. For the first three plans, 1928-40, the remarkable 
average of more than 11 per cent is indicated. The rate for the fourth 
plan, 1946-50, is even higher. Though reattainment of the 1940 level 
was not claimed until 1948, an index 37 per cent above that level was 
supposed to have been reached by 1950! The directive for the present 





17 Based principally on SSSR i Kapitalisticheskiye Strany (USSR and Capitalist Countries), 1939, 
p. 75, and B. L. Markus, “The Stakhanov Movement and the Increased Productivity of Labour in the 
USSR,” International Labour Review, July 1936, p. 7. 

18 The lower 1940 level (3.25 times the 1928 index) is established on the basis of Maslova’s 1937 
figure and the claim of a 32 per cent increase to 1940. Our higher figure is based on Turetskii’s possibly 
incorrect application of this percentage change to the 1938 figure and his (consistent) assertion that the 
1940 index was “more than 3.5 times” the 1928 base (we used 3.6). : 

19 These percentage gains for the first three plans, which vary somewhat from other published 
figures, are still shown in the 1952 article cited in footnote 12. 
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five-year plan, 1951-55, sets a target of “approximately 50 per cent” 
for the entire period,”° or about 9 per cent per year. 

‘ In view of the remarks made in the preceding section, these figures 
overstate the degree of Soviet progress. Starting from a more primitive 
productivity level, USSR should have exceeded the average American 
long-term growth rate—especially since the welfare implicit in our high 
levels of civil liberties, leisure, and consumer goods output does not 
enter into the measure of production. On the other hand, measure- 
ment according to best Western practices would probably have indi- 
cated no important gain in Soviet output per worker for the first plan 
period as a whole; a substantial gain during the second and third 
periods, say to about 75 per cent above the 1928 level by 1940; and 
reattainment of the 1940 level by the end of 1950 rather than a signifi- 
cant rise beyond it. 

The claims of substantial growth in output per worker during the 
first and fourth plans—8 per cent per year and about 13, respectively 
—are unconvincing even though the productivity potential was cer- 
tainly raised during those years. The rise in the potential during the 
first plan was partly realized in the subsequent years, as the new and 
renovated plants were utilized more fully and as labor acquired the 
new “habit of work” and gained experience. In both periods, the influx 
of workers into the nonagricultural labor force exceeded the planned 
increase by millions. Such large additions of inexperienced workers 
would seem incompatible with sharp advances in marginal productiv- 
ity if output were measured by Western methods. The first plan period, 
moreover, was very unsettled; it was marked by terror in the agricul- 
tural sector, elimination of unemployment compensation, evisceration of 
trade unions, inefficient management, widespread “wrecking,” and ac- 
knowledged deterioration of output quality. Output per worker should 
also have been unfavorably affected by the alleged reduction in official 
hours of work and in the number of work days per year. Obstacles to 
true productivity growth in the fourth plan period included poor utili- 
zation of the swollen labor force, the reconversion decline in compul- 
sory overtime, and the demoralization incident to the continuation of 
austerity and the loss of wartime “savings” through currency reform. 
The official claim that 1950 production exceeded the 1940 performance 
by 73 per cent doubtless reflected the inclusion of inflationary items 
of the kind already mentioned— e.g., material imports, the rehabilita- 
tion of war-ravaged plants and (illegally recorded) major construction 
by industry personnel, and the payment of wages to workers retained 


20 New York Times, August 23, 1952. 
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during reconversion for retraining. Measured productivity might also 
have been favorably affected by asymmetrical treatment of the output 
and the employment of war prisoners and penalized Soviet citizens en- 
gaged in industry. 

Among the productivity estimates devised by foreign students for 
Soviet industry, those of Colin Clark for 1913-36 are probably the best 
known.”! These series differ conceptually from the official productivity 
index; they purport to show net output (in constant American dollars 
of 1925-34) per employed person (wage earners and others) and per 
corresponding man-hour. Between the terminal dates, output per em- 
ployed person supposedly advanced only one-third, while output per 
man-hour increased by three-fourths. Between 1928 and 1936, the rise 
in both measures was only about one-fifth. Clark takes no direct ac- 
count of the new products introduced after 1928; he relies on a small 
list of products which could reflect the change in the total only by co- 
incidence. Curiously, he estimates a gain of more than two-fifths in 
both series for 1935-36, the first year of Stakhanovism—a gain about 
twice that claimed by the official Soviet index of output per worker. 

Clark has also prepared the longest series available on Russian real 
national product per man-hour.” This series refers to the Western con- 
cept of national product rather than to the Soviet concept, which is 
much less inclusive. It is expressed in American dollars of the period 
1925-34 and covers selected years in the interval 1900-47. The figures 
necessarily show little benefit from the shift of labor from agriculture 
to industry, since the denominator omits “disguised unemployment” 
and women in agriculture. Only in 1940 did man-hour productivity 
exceed the 1913 level—and then, barely. A net gain of only one-eighth 
is indicated for 1928-40. A serious decline is shown for the first plan 
period. The 1940 level had not been reattained by 1947. 

Jasny’s figures for agriculture, which presumably do not exclude 
“disguised unemployment,” suggest a modest rise of 28 per cent in 
net output per farm worker between 1928 and 1940.* The rise, if any, 
in net output per man-hour was smaller; the increase in the number of 
work days was probably not offset by the reduction in hours per work 





2 Review of Economic Progress, September 1949, p. 1, or Conditions of Ei ic Progress, London 
1951, p. 277. See latter source, pp. 185-87, 254-55, for details of the net product index, especially the 
series which was used in interpolation for early years and extrapolation for 1929-36. 

% Review of Economic Progress, April 1949, p. 2. The treatment of agricultural employment is dis- 
cussed in Conditions of Economic Progress, p. 190; and the productivity series is gomnented up to 1940, 
together with various underlying series, on p. 191. 

% See N. Jasny, The Socialized Agriculture of the USSR: Plans and Performance, Stanford, 1949, 
pp. 420, 676, 714; J. A. Kershaw’s review of this book, American Economic Review, March 1950, especi- 
ally pp. 183-84; and Kershaw, “Soviet Agricultural Prospects” (Rand Corporation, P-278), March 7, 
1952, pp. 10-12. 
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day. Since the 1932 level of farm productivity was doubtless much 
lower than that for 1928, the advance beyond the later year was much 
more pronounced. 

Finally, some increase in worker productivity during the planning 
era is also indicated for rail transportation.“ Freight and passenger 
traffic per “direct” employee in 1932 was 87 per cent —"ove the 1913 
average. The increase between 1928 and 1932 must have been of a simi- 
lar order. A further substantial gain during the second plan period 
brought the index to 2.7 times the 1913 figure by 1937. The sizable 
gains during these years reflect the increase in capacity utilization un- 
der the same kinds of pressure which elevated American railroad pro- 
ductivity at a still more impressive rate during 1939-43. Soviet rail 
productivity has been virtually static, however, since 1937, except for 
the wartime setback. The 1950 goal calied for an increase of only 8 
per cent above the 1937 or 1940 level, but a rise of only 2 per cent was 
achieved. In view of the increase in the work week, the 1950 level of rail 
output per man-hour was doubtless below the 1940 rate. 


IV 


Despite the defects of the available statistics and the numerous pit- 
falls of international comparison, there can be no doubt that USSR 
lags far behind U. S. in productivity. In the final section, it will be sug- 
gested further that the gap cannot easily be closed even though the dif- 
ferential might be narrowed. 

Soviet researches conducted shortly before World War II concluded 
that industrial output per worker in 1937, calculated in 1926-27 rubles, 
amounted to two-fifths the American figure. A slightly higher ratio, 44 
per cent, was claimed for output per man-hour. The ratios for output 
per worker and per man-hour were both supposed to be about one- 
fourth for 1932; and about one-sixth and one-fifth, respectively, for 
1928. USSR had supposedly caught up with Great Britain and Ger- 
many in output per worker by 1937, although rates only half as large 
were claimed for 1928. 

These figures might be challenged on various grounds (e.g., the plaus- 





% The railway productivity series was compiled from Soviet sources by J. H. Blackman, of Johns 
Hopkins University and Rand. 

% E. Vasilyev and Kh. Kovalzon, “Za Dostoinyu Sotsialisticheskogo Obshchestva Proizvoditelnosti 
Truda” (For Productivity of Labor Worthy of a Socialist Society), Planovoye Khozyaistvo (Planned 
Economy), 1939, No. 3, p. 154; and 8S. Kheinman, “Ob Ishlishkakh Rabochei Sily i o Proisvoditelnosti 
Truda” (Concerning Excess Labor Force and Labor Productivity), Problemy Ekonomiki (Problems of 
Economics), 1940, Nos. 11-12, p. 106. The claim of parity with Britain by 1937 is an exaggeration if we 
accept the British-American ratio of almost one-half computed by L. Rostas, Comparative Productivity 
in British and American Industry, Cambridge (Eng.), 1948, p. 27. 
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ibility of the dollar-ruble conversion rates). But, even if they are con- 
ceded, the gains they show are not remarkable. Comparisons made in 
1926-27 rubles should show Soviet accomplishments in a more favora- 
ble light than comparisons expressed in dollars or sterling.” Further- 
more, the years 1932 and 1937 were depressed ones for U. S. and West- 
ern Europe. Finally, a different perspective is obtained if a 1908 esti- 
mate by Lenin, cited in other contexts by Soviet authors, is juxtaposed 
to the figures for the planning era. According to Lenin, the productiv- 
ity of the Russian industrial worker was already 30 per cent of the 
American in 1908.” Since this percentage is higher than the 1928 and 
1932 proportions, the alleged gain between 1928 and 1937 largely rep- 
resents the recovery of lost ground. An advance from 30 per cent in 
1908 to only 40 per cent in 1937, a span of 30 years, would seem like 
slow progress indeed. 

Other computations made outside USSR also indicate a large Ameri- 
can-Soviet industrial productivity differential. According to Colin 
Clark, Soviet net industrial output per man-hour (all personnel) was 
less than one-fourth, and net output per employee was about one-fifth, 
the corresponding American rates in 1936. These relationships, accord- 
ing to Clark, were less favorable than in 1913. Another set of esti- 
mates,?* used in American government circles during the war, placed 
Soviet industrial output per man-hour in 1935-38 at 36 per cent of the 
American rate—a relative performance comparable to the British, 
slightly inferior to the German, but superior to the Japanese. Accord- 
ing to the same source, Soviet productivity in munitions-making was 
39 per cent of the American rate in 1944—almost equal to the British 
percentage, below the German, but substantially above the Japanese. 

“here also are some postwar estimates, the foundations of which are 
even less firm. Included among these is a low figure of one-fourth or 
one-fifth the American industrial output per man-hour,”® a median fig- 
ure of 40 per cent the American industrial output per worker,®® and a 
broad range of one-fourth to three-fourths the American output per 
nonagricultural worker.*! 





% See computations of 8. Yugenburg in Planovoye Khozyaistvo, 1937, No. 3, pp. 52-54. 

27 Figure cited by B. L. Markus in a volume issued by the Soviet Academy of Sciences, Proizvo- 
ditelnost Truda v Promyshlennosti SSSR (Labor Productivity in Soviet Industry), 1940, p. 31. 

% War Production Board figures quoted by R. W. Goldsmith, “The Power of Victory: Munitions 
Output in World War II,” Military Affairs, Spring 1946, p. 79. 

2D. B. Shimkin, “What is Russia's Industrial Strength? II,” Automotive Industries, August 10, 
1950, p. 35, and “Russia’s Industrial Expansion,” Fortune, May 1951, p. 107. 

%° Galenson, P-276, p. 20. Based on an examination of 1937-39 Soviet-American differentials in 
individual industry branches having output changes measurable in “physical” units; and on probable 
changes in Soviet and American productivity since then. 

J. A. Kershaw, “The Economic War Potential of the USSR,” American Economic Review, May 
1951, p. 479. 
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In agriculture, too, U. 8. enjoys a decisive productivity advantage. 
Before the war, according to Jasny,® over 4 times as many man-days 
were required per acre of grain in USSR as in U. S., almost 4 times as 
many per acre of cotton and potatoes, and about 6 times as many per 
acre of sugar beets. Over 6 times as much labor was expended for the 
same quantity of milk. Volin notes that the gamut of operations on 
winter grain required 2.5-3.6 man-days per acre in 1937 in the highly 
mechanized South Ukraine and North Caucasus, while the average 
direct American labor requirement for wheat was one man-day or less.* 
This differential in favor of U. S. would be increased if management 
and other overhead labor were included in the calculations. The per- 
sistence of a sizable American advantage in the postwar period is indi- 
cated by the sharp advance in American agricultural productivity since 
1939 and by continuing Soviet complaints against such irregularities 
as the employment of superfluous labor and the maldistribution of per- 
sonnel on collective farms. 

When statistics for the whole of productive activity are examined, 
USSR again eppears to be hopelessly outclassed. Clark’s series for net 
national product per man-hour, as we have already noted, omits “dis- 
guised unemployment” in agriculture, so it tends to favor USSR. 
Nevertheless, his figures indicate that Soviet productivity amounted 
to one-third the American in 1900, one-fifth in 1928, less than one-fifth 
in 1940, and less than one-eighth in 1947. Both before and after World 
War II, according to Clark, British and French productivity also 
greatly exceeded Russian levels.** When United Nations figures for 1949 
national income in American prices® are related to estimates of labor 
force, we find that Soviet income per labor force participant was only 
one-sixth the American—about $600 compared to $3500. The Soviet 
figure was only one-third the British and two-thirds the French. 

It may be superfluous to add that Soviet per capita output and con- 
sumption standards also lag far behind the American and, in many in- 
gtances, Western European.* USSR clearly has a long way to go to 
fulfill the “cardinal task,” as Stalin has designated it, of outstripping 
vhe leading “capitalist” nations in these terms. Soviet literature now 





%2 Jasny, The Socialized Agriculture of the USSR, p. 443. 

3% L. Volin, A Survey of Soviet Russian Agriculture, U. 8. Department of Agriculture, August 1951, 
pp. 33-34. 

%4 Review of Economic Progress, April 1949, p. 2. 

% United Nations Statistical Office, National and Per Capita Incomes of 70 Countries in 1949 Ex- 
pressed in U. 8. Dollars, New York, October 1950. 

38 Statistics assembled by State Department, United Nations, and Food and Agriculture Organiza- 
tion amply illustrate the Soviet lag. Also of interest is the differential in food purchasing power of an 
hour’s pay, shown by I. B. Kravis, “Work Time Required to Buy Food, 1937-50,” Monthly Labor 
Review, February 1951, pp. 147-48. Even the new 1955 goals of Soviet output per capita are only 
fractions of the corresponding actual 1951 figures for U. 8. 
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speaks of future development as a “gradual transition” from “social- 
ism” to “communism” requiring a vague number of five-year plans.*’ 


Vv 


What about the future course of Soviet productivity and the future 
of the Soviet-American productivity differential? USSR may reasona- 
bly be expected to make gains in both respects in the years to come, 
but rapid progress seems unlikely. There are restraining factors within 
the very process of broad economic development, even (especially?) 
under “planning”; in the initial conditions of industrial backwardness 
and of surplus rural population; and in the incompatibilities of Soviet 
objectives, ideology, and institutions. 

Even under ideal circumstances of abundant entreprereurial abil- 
ity at all administrative levels, time would be required for the devel- 
opment of a high productivity potential and of efficient techniques for 
realization of this potential. Time is needed at best to expand a na- 
tion’s supply of materials, power, plant and equipment, and adequately 
skilled labor. It is also needed for the establishment and maintenance 
of a dynamic, more or less harmonious balance of these expanding fac- 
tors. In USSR, it will take a long while to achieve the timely replace- 
ment of worn-out machinery, the near-optimum utilization of equip- 
ment as it becomes available, the specialization of establishments in 
lieu of excessive integration, the smooth redistribution of industrial (or 
agricultural) personnel made redundant by mechanization and auto- 
matization,.** 

The way in which the surplus labor problem is handled will have an 
important effect on the rate of Soviet productivity growth. The theory 
of “universal obligation to work,” the fact of rapid shift of labor from 
agriculture, the absence of unemployment in the market sense, the 
virtual ban on discharge of personnel, the toleration of labor hoarding, 
and the frozen status of workers contribute to the accumulation of 
poorly used reserves in industry as mechanization proceeds. There is 
no ministry responsible for combing out and redistributing such re- 
serves across ministerial or geographic boundaries—except, in a sense, 
the police. The existence of these reserves discourages the mechaniza- 
tion of auxiliary activities like loading, unloading, internal transporta- 
tion, and inspection; it also discourages the most efficient use of mech- 





3? Advancing this “gradual transition” is listed among the duties of communists in the new Soviet 
Communist Party Statute (New York Times, August 21, 1952). In June 1950, an important though 
inconclusive conference was held on “Means of a Gradual Transition from Socialism to Communism” 
(translated report in Current Digest of the Soviet Press, February 24, 1951, pp. 2-9). 

% It is significant that the directive of the new five-year plan includes a resolution “to condemn the 
practice of economic organizations which underestimate the tasks of introducing new technology and mech- 
anization of laborand which permit the incorrect use of manpower” (New York Times, August 23, 1952). 
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anisms installed for these purposes. Of course, the situation may be 
rationalized in terms of on-the-job training or the registration, control, 
and maintenance under discipline of a manpower pool for military 
emergency. If there were such an emergency, USSR could conceivably 
perform a feat which it failed to accomplish in World War II—the 
elevation of productivity as the labor force is reduced and as output 
is maintained or even increased. If “peace” continues, however, dis- 
guised unemployment in industry will serve as a brake on productivity 
advance, and the productivity potential will be far from realized. 

Three alternatives seem open to Soviet leaders. One is to redistribute 
labor compulsorily on a broad scale. The second is to develop a vigor- 
ous household sector—recognize leisure as a respectable category, re- 
duce the labor force, and raise the levels of consumption and civil 
liberties. The third is to allow a surplus to be absorbed gradually in 
the same plant or in others under the same ministry as the productive 
economy expands. The third course seems most likely to be followed. 
The second seems least likely since premature distribution of the ad- 
vertised ultimate benefits of communism could jeopardize the proletar- 
ian dictatorship, weaken control over the population, and hamper de- 
velopment of the Soviet economy along present lines. The first course is 
impractical, even though it conforms to the popular American concep- 
tion of the present Soviet way of life. 

It is unlikely that “labor enthusiasm”—what Stalin called the “zeal 
of the millions”—will have a decisive influence on the Soviet produc- 
tivity level. There is abundant evidence that the Russian worker is not 
being remolded under heat and pressure into a “Soviet man” who is 
more productive than an American worker responding to money in- 
centives. In fact, it appears that Stakhanovism, the most significant 
of the alleged manifestations of labor enthusiasm, largely owes what 
success it has achieved to incentive wages. Communist leaders still de- 
plore the “chase after the ruble” as a “capitalist remnant” that must 
be eliminated, but they also extol the “socialist” system of piece rates 
and even extend it to prison labor. 

While USSR struggles to raise productivity, our own should continue 
to rise at a modest rate. A mechanical extrapolation—based on a growth 
of 2 per cent per year in American output per worker, 4 per cent for 
USSR, and a Soviet-American productivity ratio of two-fifths in 1950— 
would lead to a ratio of only three-fifths by 1970.** 

E Lenin’s promise of victory through superproductivity seems impos- 
sible of fulfillment. But Lenin also mentioned other routes, and USSR 
has certainly improved its world position despite inferior productivity. 


%9 Galenson, P-276, pp. 24-25, presents six alternative projections, including this one. 











)0S- 
SR 
ity. 





ON PROBABILITIES IN BRIDGE 


Dan F. Wavau,* 
The National City Bank of New York 


AND 


Frepericx V. Wavuau,* 
U. S. Bureau of Agricultural Economics 


In the game of bridge, the probability of a successful 
finesse, or a favorable “break” in a suit, depends not only upon 
the original deal, but also upon all the bids and plays which 
have been made by one’s opponents. The widely quoted a 
priori probabilities, based solely upon the deal, are not gen- 
erally correct, and are often misleading, after play has begun. 
This paper shows that the evaluation of probabilities during 
the course of play of bridge requires the use of the Bayes 
formula. Two examples are given to illustrate the proper ap- 
plication of that formula to the measurement of probabilities 
in typical situations arising ir bridge. 


ROBLEMS of probability arising in the course of play of contract 
bridge are mentioned in a highly useful and entertaining book [1] 
written by one of the foremost players in the country. He writes: 


Probability problems in the play of the cards are of two general kinds: 


1. Who holds a particular card? 
2. How are the cards of a certain suit divided? 


The answer to any such question depends on the amount of information you 
have. Before the play begins, you may make an estimate of the odds; but 
as you acquire information during the play, your estimate must be revised. 


The author does not go on to suggest a method by which the revision 
could be made. 

The writers of the present article follow Borel [2] in suggesting that 
the Bayes formula be used to evaluate the revised probabilities. Borel 
uses somewhat artificial examples not taken from actual play. We shall 
analyze two examples, both taken from actual play. 


THE BAYES FORMULA 


A general exposition of the Bayes formula may be found in Uspensky 
[3]. We shall consider its application to bridge, using a notation suited 
to that purpose. We are concerned with certain probabilities to be de- 
termined after the bidding is complete, and play begun. We shall as- 


* The authors are indebted to Albert H. Morehead, bridge analyst of the New York Times, for 
many practical suggestions. 
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sume (as in most bridge columns) that South is declarer. After play 
has started, South knows at least one, and perhaps others, of the 26 
cards originally dealt to his opponents, East and West. He also knows 
the 13 cards in his own hand, and the 13 cards held by dummy (North), 
Often he is able to place with one opponent or the other, cards other 
than those which have actually been played. If, for example, West 
chooses to lead a Queen of a suit not bid by East, South may well as- 
sume that West also holds the Jack, and probably the 10 also, of that 
suit. At some stage of play, there are n unplaced cards, r held by West 
and n—r held by East. These n cards could be partitioned between 


West and East in any one of (") different ways. Each of the possible 


partitions could be defined by naming r cards in the West hand, since 
if West held these r cards, East would necessarily hold the other n—r 
unplaced cards. In general, South will need to determine the probabil- 
ity that West’s actual holding belongs to a particular group (for exam- 
ple, the group of possible partitions in which West holds the King of 
spades, or exactly three diamonds). 

Let West’s possible holdings be classified into the following m ex- 
haustive and mutually exclusive groups: 


(1) hy, he, leas. hm; 
and let the capital letters, 
(2) M,, Hp, oe Hn, 


represent the numbers of possible partitions belonging to each of these 
groups. In general, H; can be computed by well-known methods of 
combinatory analysis. The total number of possible partitions is 


N sae 7 H ‘= (*). 
i=l r 

If these N possible partitions are considered to be equally likely, the 
probability that West’s holding belongs to hy is Hi/N. This is, how- 
ever, a special case. In general, the several possible holdings are not 
equally likely. In a sense, H;/N is the a priori probability that West’s 
holding belongs to Ax, measured after play has begun but before taking 
account of the strategy followed by East and West. But H;/N is quite 
different from the kind of a priori probabilities which are quoted in all 
bridge books and bridge columns; where the probability of any finesse 
is always 4, and the probably of a 3-2 break in any 5-card suit is 
always 0.67826. The a priori probabilities that are universally quoted 
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are based upon an assumed random distribution of 26 cards; 13 to 
West and 13 to East. The usually-quoted figures are not even accurate 
measures of a priori probabilities after play has begun. 

Let S represent the particular series of bids and plays which have 
been made by East and West, and let 


(3) (S, A), (S, A), 7 a (S, Hm) 


be the conditional probabilities that S would occur if West’s holding 
belongs to hi, he, - - + , hm, respectively. 

Then, if the hand were played N times, the expected frequency of 
S from holdings belonging to h, is Hi(S, H,); and from all holdings 
is )-.,H.(S, H;). After S has occurred, the inverse probability that 
West’s holding belongs to h, is 


H,(S, H;) 





() (Ha, 8) = — 
d HAS, H.) 


t=1 


and the inverse probability of each group of holdings is 


65) Ay(S, Hy) H2(S, He) H,(S, Hm) 


> HS, H;) . H,(S, H;) > HS, H;) 








t=1 t=] t=] 
In our analysis, we find it convenient to prepare a table, such as 
1 2 3 + 5 
A, (S, Hi) H,(S, ;) (Hi, S) 
H.(S, H2) (H2, 8) 


(S, Hm) Hn(S, Hm) (Hm, S) 





D AS, Hi) 
t=1 
The first column defines each of the m groups of partitions. The fre- 
quencies in the second column are computed. The conditional prob- 
abilities in the third column must be estimated on the basis of South’s 
observation of the habits of his particular opponents or of the normal 
strategies of bridge players generally. The fourth column is computed 
from the data in columns 1 and 2, and the sum shown below. The fifth 
column exhibits the inverse probabilities defined in (4) and (5), and is 





82 AMERICAN STATISTICAL ASSOCIATION JOURNAL, MARCH 1933 


obtained by dividing each entry in the fourth column by )_7.,H.(S, H)). 

Example 1. One of the leading bridge players in this country is 
Mrs. Helen Sobel. In her book published a few years ago [4], she de- 
scribes, without giving the bidding, a hand which she played at a 
contract of six spades. Her opponents’ cards were: 


spades Q9864 
hearts Q10987642 
diamonds J8765 
clubs AJ987632 


West’s opening lead was the 4 of hearts. Mrs. Sobel played the Jack 
from dummy, and it won. She took that as placing the Queen of hearts 
with West. At trick 2, Mrs. Sobel played the Ace of trump, West and 
East following suit with the 4 and 6, respectively. East won trick 3 
with the Ace of clubs, so that in order to make her contract, Mrs. Sobel 
had to win all the remaining tricks. East led a small diamond to trick 4. 
He has no reason to credit his partner with a possible trick in trumps. 
He can see that the offense has command of hearts and clubs. Di- 
amonds will appear to him the least of several evils. South wins the 
trick in dummy, and leads the deuce of spades. East plays the 8. 
South now might play either the Jack, hoping that East holds the 
Queen, or the King, hoping that of the two spades not yet played, 
West holds the Queen and East the 9. The “percentage play,” accord- 
ing to a well-known writer, was the finesse. Mrs. Sobel, however, 
played the King; the Queen dropped; and she went on to make her 
slam. 

On what grounds is the finesse described as the “percentage play”? 
On the grounds that as of the point of completion of the deal, there were 


(33) = 10,400,600 possible partitions of the 26 East-West cards, 13 


13 
25 i ad P 
2 = 5,200,300 were partitions in which 


East was holder of the Queen of spades; and only (*) ot = 1,410, 
864 were partitions in which West was the holder of just 2 spades in- 
cluding the Queen. The oo corresponds to the fact that all West’s 
13 cards are selected in this case from the 25 cards other than the Queen 


of spades, which East and West hold between them. The (‘) (3) 


to each; and that of these ( 
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corresponds to the fact that West in this case holds with the Queen, 
one of the 4 other spades, and his remaining 11 cards are selected from 
the 21 cards other than spades, that the two partners hold. Hence the 
a priort probabilities as usually reckoned were 5,200,300/10,460,600 
or 50% that Hast was holder of the Queen of spades, and only 
1,410,864/10,400,600 or 13%, that West held the Queen doubleton.! 

That was, in Jacoby’s language, “an estimate of the odds, before the 
play begins.” It was after East’s play to trick 5 that Mrs. Sobel had 
the decision to make, whether to play the Jack of spades from her own 
hand, or to play the King. Her preliminary estimate, she is counselled 
by Jacoby, must be revised in the light of the information that she 
has acquired in the course of play thus far. At this point, she was 
able to place the East-West cards as follows: 


West East Unplaced 
spades + 8 6 Q 9 


hearts Q 4 2 109876 
diamonds 8 6 J75 


clubs 6 A J98732 


We classify West’s possible holdings into four groups, according to 
the spades held, and analyze the probabilities as indicated below: 


West’s spade 
holding H (S, H) H (8S, H) (H, 8) 
Q94 3003 1.0 3003 0.467 
Q 4 3432 ; 3432 0.533 
94 3432 - none 0.000 
4 3003 ; none 0.000 


N =12870 > Hi(S, Hx) =6435 


Mrs. Sobel explains in her book that she had frequently played with 
the holder of the West hand. It had been her experience that he habitu- 
ally made an opening lead from worthless trumps in preference to 
leading away from a suit honor. The opening trick in this hand showed 
that he had led away from the Queen of hearts in this case. The second 
trick showed that he held the 4 of the trump suit which he could have 
led instead. Mrs. Sobel concluded that he would have led away from 
the heart Queen only as an alternative to leading away from the spade 
Queen. Those are the only promising cards that East and West have 
between them, except for the Ace of clubs which East played to trick 





1 We do not say that this is the correct way to compute a priori probabilities. Doubtless one should 
consider the cards already played or placed. But the probabilities shown are those usually used in texts 
and articles on bridge. 
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3. Mrs. Sobel in effect rated the East-West plays thus far as sub- 
stantially certain for any partition belonging to set hi or he, and as 
practically certain not to have been made from any belonging to set 
hs or hy. On those assumptions, the finesse is practically certain to fail, 
and the play of the king has a better than even chance to succeed. 
Using equation (4), the probability that West originally held the spade 
9 4, or the lone 4 is [H3(S, Hs) +H.(S, H,)]/>-%.,H(S, H;)=0. The 
probability that he originally held the Q 4 is H,(S, H2)/> 2-1 HA(S, 
H,) =0.533, or 53.3%. 

Thus the “estimate of the odds, before the play begins,” and the 
estimate revised according to information acquired during the play, 
lead to opposite conclusions: 


@ priori revised 
chance of successful finesse 50.0% Nil 
chance of dropping Queen 13.6% 53.3% 


Example 1 illustrates a case in which one of the alternative lines of 
play is certain to fail. Declarer obviously should try the other alterna- 
tive, however low the probability of success. Thus, the probability of 
53.3% that declarer can succeed in dropping the Queen is of only aca- 
demic interest. It is shown to indicate that the correct probability 
differs substantially from the a priori probability, which bridge experts 
use to find the “percentage play.” More important is the conclusion 
that West, rather than East, is certain to hold the Queen of spades. 
This conclusion can be reached only by an analysis of inverse prob- 
ability; whether a formal analysis, such as that above, or an informal 
analysis, such as Mrs. Sobel made. 

Example 2. Another leading bridge player, Mr. George Rapee, de- 
scribes in a newspaper article [5] a match played in the Eastern States 
contract bridge championships in March 1952. 

The bidding was: 


South West North East 


2 no trump pass 3 diamonds pass 
3 no trump pass 5 no trump pass 
6 no trump pass pass pass 


After 8 tricks of play, South had been able to place the East-West 
cards as follows: 


West Unplaced 


spades Q7652 

hearts J 10 95 
diamonds J982 

clubs 3 
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The 6 unplaced cards must be distributed 1 to West and 5 to East. 
Declarer has lost one trick, so must take the remaining five tricks to 
make his contract. The lead at this point lies in dummy. South can 
make his contract by leading hearts if West’s one unplaced card is the 
Queen of clubs. Or he can make it by leading clubs if West’s unplaced 
card is one of the two hearts. West’s original heart and club holdings 
may be classified as follows: 


Wests’ holdings 





mB 
g 


H (S, H) (H, 8) 
Hearts Clubs 
J 109 
J 105 


3 0.048 
3 
J 10 Q 
9 
7 
6 


0.000 
0.238 
0.238 
0.238 
0.238 


J 10 
J 10 
J 10 


=e Se © © 


coooon 
— = = = CO © 
cocoon 


c= 
i) 


Success with the club lead depends only on finding the Queen of clubs 
with East. As of the point of completion of the deal, the probability of 
that was 0.5. 

Success with the heart lead requires (1) that East hold the 9 of 
hearts, (2) that West hold the Queen of clubs, and (3) that West hold 
exactly one of the other East-West clubs. As of the point of completion 
of the deal, the probability of that was approximately 0.008. 

If one follows the usual folklore of bridge, relying solely upon a 
priort probabilities, he would have to agree with Mr. Rapee that the 
heart lead was “the mathematically inferior play” in this case. But this 
judgment completely disregards the bidding and 8 tricks of play. And 
the developments during play have greatly changed the probabilities 
in question. 

We now show how the probabilities should be appraised at the 
beginning of trick 9. 

The heart lead will succeed if West’s original holding were h;; the 
club lead will succeed if West’s original holding were either h; or he. 
There has been nothing in the East-West play thus far, it appears to us, 
and we assume that South had the same impression, that has any rele- 
vance to the East-West heart-club holdings, except West’s play of the 
10 J of hearts to tricks 2 and 7. South considered it “unlikely,” the 
news item states, that his particular opponent in this case would have 
“false-carded” by playing 10 and then Jack from a holding of J 10 9. 
To make it concrete, we assume that West would have done so no more 





86 AMERICAN STATISTICAL ASSOCIATION JOURNAL, MARCH 1953 


than 1 time in every 5. We take it as certain that he would not have 
played 10 J, retaining the 5, from the holding J 10 5. 

On these assumptions, as the table shows, the probability that the 
East-West plays have been made from the holding hs is 0.238. The 
probability that they have been made from either h; or hz is 0.048, 
On that reckoning, the heart lead is not “the mathematically inferior 
play.” It has, it is true, less than a 3 chance of succeeding, but the 
club lead has less than 1/20. 

The probabilities of success as ordinarily computed a priori as of 
completion of the deal, and as computed by the method here outlined, 
as of the point of South’s decision, compare as follows: 

@ priori revised 
Heart lead 0.008 0.238 
Club leau 0.500 0.048 


CONCLUDING REMARKS 


Many additional examples could be given. Books’and newspaper 
columns analyze hundreds of problems of probability arising during the 
play of bridge. We have never noted a case in which the analysis has 
been theoretically correct, and find that the answers given are often 
misleading.’ Theoretically any problem of probability arising during 
the play of bridge could be measured correctly by the Bayes formula, 
and only by the Bayes formula. 

The application of the Bayes formula to typical problems arising in 
bridge may seem too complicated to have practical value in actual play. 
We note, however: (1) that an understanding of the principles of in- 
verse probability will frequently help a player make a quick and rough 
judgment as to which alternative is superior, and (2) that one can 
work out once and for all the probabilities associated with certain 
typical situations which commonly arise. Concerning the second point, 
no player would stop the game to compute the a priori probability that 
6 unplaced clubs would be split 3-3, but he can remember that it is 
less than 3. 

The main difficulty with the analysis is that of making realistic 
estimates of the conditional probabilities (S, H;). But a good bridge 
player can form some shrewd judgments concerning these conditional 
probabilities by observing the habits of his opponents, and of bridge 
players generally. Thus East not having a bid against a no trump 





2 We mean no disrespect to the bridge experts. Many of them are brilliant men, and most of them 
know more about bridge than we do. We object only to the plain fact that the “probabilities” which 
they universally use do not, in general, measure the probability of success with a given line of play. 
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contract, West’s opening lead very commonly is the fourth highest 
card in his best suit; against other contracts he may avoid leading away 
from a King, if he leads a Queen in an unbid suit he probably holds 
the Q J 10, and so on. Unless one takes account of these habits, con- 
ventions, or strategies, he can not estimate which alternative play is 
superior. The commonly-quoted a prior: probabilities usually are not 
very helpful, and often are downright misleading. 
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PROBABILITIES OF CERTAIN SOLITAIRE 
CARD GAMES 


Rosert E. GREENWOOD 
University of Texas 


I. INTRODUCTION 


INCE the days of the Extra Sensory Perception (ESP) experiments, 

mathematicians and statisticians have been especially interested 
in “matching” card problems. In this note, two types of solitaire card 
games involving matches will be considered in which the ordinary 52 
card deck is used. For convenience the two card games will be called 
games A and B. (It will be recalled that the ESP experiments used 5 
sequences of 5 cards each, or a 25 card deck. A tabulation of matching 
probabilities for such a 25 card deck is given by Greville [4]). 

In game A with 52 cards, a player plays out the cards one at a time 
against another deck (or against a standard order) and scores a success 
if he can go through the deck without getting a match in suit and se- 
quence. In game B the suit feature is disregarded and the deck is con- 
sidered as a multiple set of four sequences of 13 cards each. The player 
scores a success if he can go through this multiple deck without getting 
a match (sequence match only). 

Game A is a special case of the classical “probléme des rencontres,” 
and has been dealt with by many authors. See Huff [5] and the ap- 
pended editorial note which gives additional references, or Feller [2], 
pages 62-63 and 66-69. The probability of obtaining exactly k matches 
is given approximately by 


(1) P, = 1/e-kl, k = 0,1,2,---,6,7. 


Game B has been considered by Battin [1] and Kaplansky [6]. 
Special generating functions to give the probabilities of the various 
numbers of matches or hits have been developed. Kaplansky [6, page 
911], notes that asymptotic approximations are available for large decks, 
but states that such approximations are too crude for game B with 
52 cards. For a fourfold deck with a large number of cards (n>>52), 
the probability of no matches is approximately 1/e'. 

Joseph A. Greenwood [3] investigated the variance of certain card- 
matching processes, and from a general formula given by him the 
variances of the number of matches in games A and B are found to be 


(2) 2 = ) op? = 64/17. 
88 
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The means of the number of matches are well-known, 
(3) ba = 1; ba = 4. 

II. PROBABILITY OF A WIN FOR GAME B 


To calculate the probability Po for a win (i.e. no matches) for game 
B, one may proceed as follows. Let S be defined by the relation 


(4) S=2+ +--+: + Xs, 


where 2; is associated with the ith card in some convenient numbering 
of the 13 cards in a sequence. Then a draw or a play from (S—z;,) 
could be interpreted as signifying that the 7th card was not played but 
that the other twelve possible cards all had equal chances of being 


played. 
The product 
13 
(5) I] (s - 2,4 


t=] 


could be interpreted as signifying that for four specified plays no cards 
marked 1 were played, for four other specified plays no cards marked 2 
were played, etc. Nothing in the above signifies that four cards of 
each category must be played in the 52 plays. However, if one were 
select the coefficient of (2:22 - - - 2:3) in the expansion of form (5), 
one does get such an assurance. Indeed, this coefficient is the number 
of ways in which a 52 card deck (considered as a multiple deck of four 
13 card sequences) can be played out so as to give no matches against 
a standard order. The division by the total number of ways of playing 
out such a deck, 52!/(4!)", will give the desired probability Po. Hence 
the coefficient of the term (2172 - - - 213) in the expansion of the form 


K (21, 22, +++ , 213) 
(4!)18 13 


(6) 52 U (S— a! 


0° “i 
- 62 (> 2) +5"(D 622+ > 162.0) meee ] 


«52! inj 








is the probability Po. The numerical coefficients in the above expansion 
are obtainable in terms of multinomial and binomial coefficients. 

If one identifies A; as the absolute value of the coefficient of the 
term involving S*-* in expansion (6), then 


(7) Py = Ap — Ar + As —° ++ + Ase. 
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In addition to the probability for no matches (called Po» above) the 
probabilities for 1, 2, 3,---+, matches are of some interest. Rather 
than develop these explicitly, it is convenient to refer to a procedural 
method of Kaplansky, who formulated an algebraic symbolism and 
applied combinatorial methods thereto in order to obtain an analogue 
to Poincaré’s formula in probability. The adaptation of Kaplansky’s 
result [6] to the problem at hand gives 


k+1 k+2 
Pr = Ar —( k ) Aaa + ( k ) Ane =o 


52 
+ ( ) Au 
k 


This is just Poincare’s formula with a different interpretation of the 
symbolism. 

The numerical values {A;} may be computed by direct but tiresome 
methods. One gets 


Ao = 1; A; = 4; 


4 [ 50! ()(") 
> Laneatle/\1 
50! 4\ /4\ /18\7 (4) 134 
. anna \a (a (3) 52! «17 


A; = 13004/1275. 


(8) 





(9) 





The denominators tend to get larger; to avoid changing denominators, 
a table of values of CA; where C was chosen so as to give integral values 
to CA; is given in Table 1. A decimal approximation to A; is also 


given. The value of C was chosen to be 
ws C = 520) = 52-51-50--- 38 
= 5,860,187,598,276,457,205,760,000 


(although the least common denominator required was not quite that 


large). 
The estimated values for Ais, - - - , Aig were obtained by computing 
ratios of successive A; values, 7=0, 1, ---, 15, constructing a differ- 


ence table, and extrapolating. 
Using truncated forms of relations (7) and (8) one may now estimate 
the values P,, k=0, 1, 2, - - - , 7 as given below in Table 2. Using a rule 
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TABLE 1 
é CAg Decimal approximation to A; 
0 5 8601 8759 8276 4572 0576 0000 1 
1 23 4407 5039 3105 8288 2304 0000 4 
2 46 1920 6695 1120 3097 3952 0000 7.8823 5294 
3 59 7693 1727 6852 5878 4604 1600 10.1992 1569 
4 57 1075 3987 0269 2981 1210 2400 9.7450 0200 
5 42 9610 1417 7847 3070 1987 8400 7.3309 9640 
6 26 4959 3716 6818 1148 6146 5600 4.5213 4624 i 
7 13 7742 2222 3641 6652 5747 2000 2.3504 7462 
8 6 1590 4477 5471 4063 5549 6960 1.0509 9789 
9 2 4053 2409 1387 4685 5882 7520 -4104 5172 
10 8303 3647 3254 8076 8716 8000 -141U 9111 
ll 2558 1494 8800 1913 1236 3520 -0436 5303 1 
12 708 9146 7824 0756 1191 4240 -0120 9713 3 
13 177 8606 3073 7964 4982 8864 -0030 3506 72 
14 40 6209 3763 9221 2378 4192 -4006 9316 787 
15 8 4841 4962 3331 6627 2512 -0001 4477 608 


-0000 2776 (estimated) 
-0000 0490 (estimated) 
-0000 0080 (estimated) 
-0000 0012 (estimated) 





of thumb method, it is possible to get an estimate of the size of the 


error. Note that A;>A.4: for +=3, 4,---+, 14. The series (8) is an 
alternating series; arithmetical calculations suggest that for each k 
in the range above there is an integer m such that after the term involv- 
ing Ax ym the series is an alternating decreasing series. One may then 
estimate the size of the error as being less than the size of the next 


(rejected) term. Thus 


20 
(11) | Estimated error in P, | Ss (|, ) 40 k=0,1,2,---, 7. 


TABLE 2 








Estimated proba- Estimated Error 


Gram-Charlier 





bilities for game B probabilities 
Po .016233 + .00000002 -016161 
a | -068899 — .0000004 -068953 
P, . 14416 + .000004 - 144370 
Py .19821 — .00003 - 198240 
P, .2013 + .0001 -201113 
Ps .1613 — .0003 - 160890 
Ps -1052 + .0008 - 105728 
P; .060 — .0015 -058665 














92 AMERICAN STATISTICAL ASSOCIATION JOURNAL, MARCH 1953 


Assuming Ax» =0.0000 0002, the sizes of the estimated errors may be 
computed from relation (11). These values are also tabulated in Table 
2. It must be stressed that these estimates are only useful heuristic 
guides, i.e. no claim is made that the absolute error satisfies relation 
(11). 

As is well known, game A may be approximately described by the 
Poisson distribution, relation (1). For game B, however, the mean and 
the variance of the distribution of hits are not equal (being 4 and 64/17 
respectively). Rather than choose the Poisson distribution (for which 
the mean and the variance are equal) it may be advisable to consider 
some distribution which does not require equality of these statistics, 
for example the Gram-Charlier series of type B.! 

If f(k) is the Poisson distribution entry, then the first two non-zero 
terms of the Gram-Charlier series [7] are 


(12) g(k) = f(k) + (1/2)(Variance — mean) A?f(k). 


Values of g(k) have been calculated and are also given in Table 2. The 
rather close agreement between the Gram-Charlier series and the es- 
timated probabilities from relation (8) is certainly remarkable. 

The numerical calculations were either checked by another method 
or repeated by the same method. The results stated are believed to be 
accurate, but due to the extreme tediousness of this work no claim for 
error-free results is made. 


III. EXPERIMENTAL TEST 


In order to observe the theory in practice, 1000 games of solitaire 
were played, the number of games being fixed in advance. For each 
game, the cards were thoroughly shuffled (at least 6 interlacing shuffles 
and 2 mixed cuts using three or more stacks), and played out for all 52 
cards. Two types of scores were kept, one as if game A were being 
played and the other as if it were game B. For game A, it is possible 
to compare the distribution of hits with a theoretical distribution based 
on relations (1), and the variance of the hit distribution with (2). For 
game B a limited comparison is possible based on the partial distribu- 
tion given by Table 2, and the variance of the hit distribution as given 
by (2). The tabulation of results follows: 





1 The author is indebted to Nathan Mantel, Biometrics Section, National Institute of Health, 
for suggesting the Gram-Charlier series and noting its close approximation to the probabilities given by 
(8). 
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GAME A 
| Number of hits | 0 | 1 2 | 3 4 | 5 Joana over 
Expected games 368 368 184 61 15 3 1 
Observed games 359 384 174 66 16 1 0 
































Expected mean of hit distribution =1, observed mean =0.999. Expected variance of hit distribution =1, 
observed variance (about expected mean) =0.957. 





GAME B 
11 
Number of hits 0 1 2 3 4 5 6 7 8 9 10 | and 
over 





Expected games 
(from relation 8) 16 69 144 | 198 | 201 | 160 | 105 60 _ _— _ _ 
Observed games 14 56 | 174 | 194 | 210] 153 | 108 | 55 22 8 6 0 
Expected games 
Gram-Charlier 16 69 | 144 | 198 | 201 | 161 | 106 | 59 | 28 | 12 4 2 















































Expected mean of hit distribution =4, observed mean =3.932. Expected variance of hit distribution 
=64/17 =3.765 approximately, observed variance (about expected mean) =3.466. 


It will be noted that the deviations from the expected values are for 
the most part small and reasonable. The close agreement between the 
values computed from relation (8) and the Gram-Charlier values has 
already been commented on. 
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NON-LINEAR FUNCTIONAL RELATIONSHIP BETWEEN 
TWO VARIABLES WHEN ONE VARIABLE IS 
CONTROLLED 


R. C. Geary 
Central Statistics Office, Dublin 


Joseph Berkson has indicated that when the error of one of 
two variates is “controlled” in a sense which he defines, and 
when the true relation between the variates is linear, the classi- 
cal regression formula can validly be used for the unbiased 
estimation of the coefficients. In the present paper, Berkson’s 
results are discussed and a theory of estimation and of tests of 
significance is sketched for the case of controlled experiments 
when the inherent relation between two variables is non-linear, 
The results are applied to the study of a constructed example. 


OSEPH BERKSON [1] has made a significant contribution to regression 
J theory and practice in isolating a case in which classical regression 
theory can validly be used to express a functional relationship between 
two variables, both subject to error. Berkson’s viewpoint justifies the 
use of linear regression theory without qualification in the typical 
scientific experiment, in which conditions are partially subject to con- 
trol. This viewpoint may have been known implicitly or explicitly to 
many statisticians. The writer was not aware of it, though he has 
worked on the theory of relationship between statistical variables. That 
other students may be in the same situation may justify the present 
further consideration of Berkson’s results. 

Berkson’s main point is that in the case of the “independent vari- 
able” it is possible to distinguish between what he terms “controlled” 
and “uncontrolled” measurements. The objects of the present contribu- 
tion are (a) to clarify this distinction; (b) to consider the implication 
of the distinction in its relation to non-linear regression; and (c) in 
particular, to sketch in a sampling theory, with tests of significance, 
appropriate to controlled observations. It is shown that, while the 
R. A. Fisher [2] theories of tests of significance apply unchanged in the 
linear case, they do not apply in the case of non-linear inherent rela- 
tionships. 

It may be useful, at the outset, to explain briefly the difference be- 
tween uncontrolled and controlled measurements by reference to a 
particular example. An operator is making controlled observations when 
he is measuring out quantities z of a drug fixed in advance, say 1 gm, 
2 gms, 3 gms, etc., and measuring some reaction v. In making these 
measurements, however, he makes errors: when he thinks he is measur- 
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ing 1 gm, he has really measured 0.95 gm, 1.13 gm, etc.; when he 
thinks he has measured 2 gms, he has really measured 1.81 gm, 2.05 
gm, etc. Assuming a linear relationship between the nominal! measure- 
ment x and the actual reaction », it is clear that there are two sources of 
error in v: (i) that due solely to the error in measuring 2, and (ii) that in 
measuring »v itself, the actual recorded value being y. The operator’s 
recorded pairs of observations will, of course, be (z, y) where z is 1 
gm, 2 gms, 3 gms, etc. In this example, if the inherent relation is 
linear, i.e. if the relationship between the true values u and », of z and y 
respectively, is v=a-+-6u, then Berkson has shown that the classical 
regression formula of y on z will give statistically consistent estimates 
of the coefficients a and 8, making the usual assumption of randomness 
in regard to the errors of observation in the measurements of the two 
variables. This conclusion extends to the many-variable case as well, 
when the relation is linear. 

In the case of uncontrolled measurements, the operator is not trying to 
work to any preassigned schedule of measurements of the “inde- 
pendent” variable. He simply measures x and the reaction y and asso- 
ciates one with the other. In this case the classical regression approach 
gives biased estimates of the coefficient 6 even if the number of pairs 
of observations is indefinitely great, for reasons which are well-known: 
in non-mathematical terms the regression of “y on z” means that one 
is associating the y’s with the z’s strung out in order. In ordering the 
x’s, however, one is necessarily taking into account the errors in the 
z’s as well as the “hard core” which is the correct measurement. In 
simple terms, the x’s to the right of the scale will contain large positive 
errors and those on the left large negative errors. This is why the re- 
gression of y on z is “too flat” in the z-direction and, of course, if one 
orders according to y, too flat in the y-direction in the case of regression 
of z on y: hence the two regressions. While in Berkson’s case of con- 
trolled experiment and in the case of uncontrolled experiment, which 
occurs when neither variable can be regarded as the “independent” 
variable (case II of Table 1), both regressions cannot be valid—in 
Berkson’s case one only, and in case II neither, is valid—Frisch [3] and 
Reiersgl [7] have shown that in the case of uncontrolled experimentation 
both regression equations can be used to derive useful information 
about the true values of coefficients. In the case of two variables in 
fact, it can easily be shown that the true (functional) coefficient 6 
must lie between the two regression coefficients. Except when the 
number of pairs of observations is very large and especially when the 
variables are closely correlated (when the regression lines lie close to- 
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gether) it is usually as good a procedure as any to take a simple average 
of the two regression coefficients as the estimate of the true value since 
this estimate is likely to differ from the true value by less than the error 
standard deviation from the methods referred to for case II in column3 
of Table 1. While, therefore, the answer to Berkson’s rhetorical ques- 
tion [1]: “Are there two regressions?” is “No,” it must be added that in 
many cases valuable information can be derived from the two regression 
coefficients. 

Table 1 shows the appropriate solution applicable to the commonest 
cases when the inherent relation is linear. The actual observations are 
always x and y where 
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z=urte 
(1) y=o+f 
v =a-+t Bu, 


e and f are the respective random errors, u and »v the respective (un- 
known) true observations which are presumed to be connected by an 
exact linear relation. The statistical problem is the estimation of the 
coefficients a and 8. 














TABLE 1 
INHERENT RELATION LINEAR 
Case Conditions Solution 
I. Classical re- | e=0, f statisti- | Least squares, strictly applicable only 
gression cally independ- | when f normally distributed, mean zero 
ent of u=zand | and variance the same for all values of z, 
v when the R. A. Fisher [2] significance 
test theory for coefficients and linearity 
also applies. 

II. Both variables | e and f statisti- | (i) Wald [8]: classification of observa- 
subject to un- | cally independ- tions into two groups 
controlled error | ent of one an- | (ii) Reiersgl [6]: used third variable z 

other and of u related to z and y 

and v (iii) Geary [4]: used semi-invariants of 
two dimensions of total degree higher 
than 2—applicable only for non- 
normal distribution in (z, y) 

III. Both variables | e and f statisti- | Berkson [1]: showed that classical re- 
subject to er- | cally independ- | gression formula for y on z applied, 
ror, that of z | ent of one an- | subject to special conditions specified 
controlled other and of z | above for I. Fisher significance tests 

and v apply. 

















1953 


rage 
ince 
rror 
nn 3 
ues- 
tin 
sion 


nest 
are 


un- 
r an 
the 


only 
Zero 
of 7, 
ance 
rity 


rva- 
le z 


8 of 
sher 
10n- 


re- 
ied, 
fied 
ests 





NON-LINEAR FUNCTIONAL RELATIONSHIP BETWEEN TWO VARIABLES 97 


It will be useful for what follows to reproduce what is, in effect, 
Berkson’s algebra. From (1), 


(2) y=a+ B(x —e) +f. 


Let the number of z-grades be m. To simplify the algebra I shall as- 
sume that, since the scale of z, the presumed observation, is under the 
control of the experimenter, he is working to such a scale that 


> tet = , k=1,2,---, 
t=] 
and further that the “experiment” consists in making m pairs of actual 
observations (2z;, y:). There may be many replications of the experi- 
ment. Then, denoting by E the average of an indefinitely large number 
of replications, and from (2), assuming that in each z-grade He and Ef 
are both zero, 


> z,*Ey; =a p a x°* = mapr, k=0,1,2,3,---, 


t=1 i 


(3) 
} x;*—'Ky; = B > x;7* = mBuee, k=1,2,---, 


where ox is the 2kth moment of the z, the odd moments being now 
zero. The simplest solution in (a, 8) is 


ma = >, Ey: 


4 
“) mByu:, = > rEy;. 


Consistent estimates a and b of a and 6 respectively are given by 


Mapa = > x*¥;, k=0, 1, 2,3,---, 


t=] 


mbpor od > a 27-19, k= 1, 2; OH 


+ 


(5) 


where #; is the mean of the observations yin the 7th z-grade; 9;=y; if 
there are no replications. The classical regression estimates will be 
obtained fron (5) by taking the lowest values of k. It may be worth 
observing that, while all values of k will yield consistent estimates of the 
coefficients, we have no hesitation in favouring the classical regression, 
or simplest solutions, on grounds of efficency, when the variance of 9; 
is the same in each of the z-grades, for then the regression solutions 
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are the maximum likelihood solutions (when e and f are normally dis- 
tributed) and so have minimum variance. 

As regards tests of significance and the establishment of confidence 
limits of the estimates a and }, it is obvious from (2) that the R. A. 
Fisher [2] theory, with the usual assumptions, applies literally. The 
error in the ordinate is, however, now (f—e) instead of f as in the case 
of the Fisher theory. The basic assumptions are satisfied, however, 
namely that the error variance can be assumed to be the same and the 
errors normally distributed for the different values of x. Since f and e 
are statistically independent, the (unknown) error variance will be 
o7'+6’c2 and the significance test will be less sensitive and the range 
of the confidence limits wider than in the Fisher case. In this regard 
the considerable effect of the coefficient 8 will be noted. 


THE NON-LINEAR CASE 


The sampling theory appropriate to the non-linear case of controlled 
experiments is, however, fundamentally different from the Fisher 
theory which applies when the error e is zero and, as Fisher has shown, 
the appropriate tests of significance are basically identical with those 
used in the linear case. It will suffice to discuss the estimation of co- 
efficients for the case of the polynomial of the third degree. The alge- 
bra is extremely simple and the extension to higher degree poly- 
nomials will present no special difficulty. 

As before, the presumed measure of the abscissa is z but the real 
measure is u, the error being e, so that = u-+e, and the actual measure 
of the ordinate is y with error f, so that y=v+/f. The exact relation 
between u and v is 


(6) v=at But yu? + bu’. 


The problem is toestimate the coefficients a, 8, y, and 6 and to discuss 
their sampling errors. We assume that Fe, Ee*, and Ef are all zero for 
each x; that the unknown variance ¢,= Ee? is the same for all the m 
values of z; and that 


1 m 
m= = > s* 
™ jm 


@) 0 = ae z,*—!, 


In the linear case a sampling theory can be derived from a single ob- 
servation y for each x. According to the method developed in the 
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present paper replications are required in the non-linear case.’ We 
assume, for simplicity, that the number of observations in each of the 
maz-grades is the same, i.e. that the whole experiment is repeated n 
times. From (6) 


(8) ys = a + B(ty — &:) + (zi — &:)? + 5(2; — €:)* + hi, 


{= 1,2,---,m. 


Multiplying (8) across by 2,°, 2, z?, and 2? in succession, averaging 
in each z-grade and summing for all grades, we find 


vo = & + Yue 

vy = nue + Ops 
(9) 

vor = fue + Ya 

vsi = nha + Sys 
where 


1 
va = > Ci LY: 


m i 
(10) 
E=at ve 
n= B+ 36e, 


2 being the variance of the error e. 
From (9) we can readily find y and 6 as follows: 


1 
(us — we”) = va — wevo = = )» (z,? — we) Ey; 
(11) 


1 
(weve — w4”)d = porn — Mar = Pr pm (ues? — pyr.) Ey. 


It will be noted that, while ~ and 7 can also be found from (9), a 
and 6 cannot, without the intrusion of the nuisance parameter é2. 
Actually the estimates of y and 6 are equivalent to the least squares 
estimates: those of a and £ are not, except in the trivial case of ¢.=0. 
We are usually interested only in the formula for 6 in order to establish 





1 Theoretically it is possible to conceive of a model whereby, with a single y-observation in each 
grade, maximum likelihood estimates of the error frequency parameters, together with the estimates of 
the coefficients could be derived. Assume, e.g., that the independent errors e and f are normally dis- 
tributed with means sero and that the error distributions are the same in each z-grade, and that the 
number of grades (m) is not less than all the parameters to be estimated. However, even in the simplest 
non-linear case, i.e. when the coefficient 4 in (6) is sero, the five maximum likelihood equations (i.e. those 
for the estimation of o¢, 7, a, 8, 7) are extremely complicated and impracticable of application. 
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that it is not significantly different from zero. If the estimate of 3, 
according to the sampling theory developed below, is not significantly 
different from zero, then the universal variance of y in the ith x-grade 
will be from (8), 


(12) (8 + 272i)€ + yea + oe 


where « is the fourth moment of the error e and @» is the variance of 
error f. (m—1) estimates of ¢2 can accordingly be found by equating 
the differences of the estimated variances of the sample values of y’s 
in consecutive xz-grades to the differences derived from (12), using, of 
course, the estimates b and c of 8 and 7, as follows: 


ex; { (b + 2cx;1)? — (b+ 2cx,)?} = var ¥i-1 — Var ¥;; 
(13) #=2,3,...,m 


In practice the (m—1) estimates eo; of €2 will, of course, differ: the 
simplest procedure may be to take the simple average of the (m—1) 
estimates as the definitive estimate e of €2. It is, of course, evident that 
this procedure for the estimation of ¢2 is designed only to furnish a 
consistent estimate. It is not equivalent to the maximum likelihood 
solution and is probably not the most efficient. With 5=0, the es- 
timates a, b, c of a, 8 and y are then immediately obtained from (9). 


SAMPLING ASPECTS FOR THE NON-LINEAR CASE 


It may be remarked at the outset that the approach in this paper 
to tests of significance in the non-linear case is essentially approxima- 
tive. It consists simply in making estimates of the variances and using 
these estimates as tests: this test becomes exact only when the number 
of replications is indefinably large. As is well-known, the Fisher test in 
the non-linear case is exact, under certain conditions, even when there 
is only a single series of y-observations, i.e. one replication of the ex- 
periment. 

From (11), the estimates c and d of y and 6 are found by substituting 
sample means 4; for Ey; and the sampling variances are given by 


1 
(14) (us — wo”)? var ¢ = —— }) (2,2 — me)? var yi 
mn | 
P 1 
(15) (ueu2 — ma*)? var d = —— Dd (wer? — wari)? var yi, 
mn j; 


where n is the number of replications of the whole experiment. If 4 is 
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not significantly different from zero, from (9) 


1 
(16) be” var b = yon 9 >> 22 var yi. 
mn ¢ 


As to the estimate a of a, its variance, given €2, is, from (9) and (10), 
(17) vara = var ¥ — 2(u2 + €) covar (9, c) + (ue + e)* var ec, 
with 


(18) covar (9, c) = >. (2.7 — me) var yi/m?n(us — pr”). 


% 
STUDY OF A CONSTRUCTED EXAMPLE 


To see how the foregoing procedures would work in practice, an arti- 
ficial example has been constructed on the following lines. Let the 
inherent relation be 


(19) v=3+ 2u — u? 


and let there be 5 (=m) values of z, namely —6, —3, 0, +3, +6. 
There were 10 (=n) replications of the whole experiment. The c>- 
served values of xz, deemed to be these five values, are subject to errors 
which are assumed to be unbiased. The ordinates are also subject to 
random error. Actually the “induced” errors were normally distributed 
with mean zero and variance unity and were in fact those (1/10) 
given by E. S. Pearson [5, p. 349] for the normal population, the 50 
values in the first column of Pearson’s table being regarded as the 
errors e in x and the 50 values in the second column being those (f) 
for y. Thus the actual values in the case of the second observation 
(ie. c= —3) of the first of ten experiments were not the true values 
z= —3 and »=34+2X —3-—-(—3)?= —12 but u= —3+1.0= —2.0 and 
y=3+2X —2.0—(2.0)?—0.4= —5.40. The 50 actual values of y as 
“observed” are given in the table, with computed means and variances. 
The theoretical values, which would, of course, be unknown in actual 
experimentation, are also given as well as two theoretical series of 
values of the means, one biased due to the error term —(e)? the 
mean of which is unity, and the other with this term eliminated. The 
sample average and variance for x= —6 goes far astray because of the 
abnormally high entry at experiment X due in turn to the fact that in 
this case the z-error was 3.0. One would probably decide to eliminate 
or to adjust this value by reference to the other nine values in the 
column: though this is always a hazardous procedure if bias is to be 
avoided. If this discrepant value were eliminated, the resulting (sample) 
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mean would be 44.22 and the variance 177.65. For the following com- 
putations the original values have been retained. 

The first fact to note is the great variability of the variance from 
grade to grade, which would make the assumption of equal variance, 
necessary for the application of the Fisher sampling theory, quite 
unreal. The value of d, the estimate of 6, the coefficient of u’, is, using 
(11), 0.0127. As the standard deviation, estimated from (15), is 0.0292, 
we infer that 6 is not significantly different from zero. The estimates 
of y and 6+ their standard deviations estimated from (14) and (16) 
are 


ec = — 1.03 + 0.12 
b= 2.44 + 0.49 


(20) 


so that the sample values do not differ significantly from the true values 
—1 and +2 respectively. 























TABLE 2 
CONSTRUCTED EXAMPLE: INHERENT RELATION QUADRATIC 
Observed values of y for z= 
Experiment 
—6 -—3 0 +3 +6 
I —52.25 —5.40 4.86 | —4.24 | —28.14 
II —57.91 | —34.44 1.76 3.36 | —14.24 
III —39.15 | —15.56 5.51 1.10 —9.36 
IV —42.59 —8.55 4.16 | —5.74 | —10.09 
V —14.79 —5.59 3.50 1.96 | —21.20 
VI —35.89 —9.04 4.01 1.26 | —20.70 
VII —40.16 —4.34 | —2.80 1.11 —8.26 
VIII —59.40 | —17.19 2.70 | —0.94 | —33.59 
Ix —55.84 | —17.79 3.36 4.29 | —29.29 
x —95.50 | —14.74 5.40 | —4.79 | —17.39 
Mean 
Experiments (biased) —49.35 | —13.27 3.25 | —0.26 | —19.23 
Theoretical* 
Biased — 46 -—13 2 -1 —22 
Unbiased —45 —12 3 0 —21 
Variance 
Experiments (9 d.f.) 440.49 81.25 5.88 12.40 80.27 
Theoretical* 199 67 z 19 103 























* Unknown in the case of actual experimentation. 
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As to the estimate of a, it is first necessary to estimate the variance 
¢. of the error of observation e of z. Using (13), with the sample 
values for b and c, the four estimates of €2 (i.e. @22, €23, €24, aNd é25) are 
respectively 2.48, 1.10, 0.81, and 0.80, the mean of which is 1.3, which 
is taken as the estimate of ¢2: the actual value, as we know, is 1. Using 


(9) and (10) 
Mean y = a + c(ee + ps) 
or —15.77 = a — 1.03(1.3 + 18) 
giving a=4.11, the actual value being 3. The variance estimate, com- 
puted from (17), using sample values, is 1.91, so that the standard devi- 


ation is 1.4. The error of estimate of a, namely 1.11, is accordingly less 
than the estimated standard deviation. 


(21) 
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COMBINATION OF NEIGHBORING CELLS 
IN CONTINGENCY TABLES 


C. C. Crate 
University of Michigan 

In the application of the x*-test for independence to con- 
tingency tables in case the expected frequencies in some of the 
cases are small, a commonly recommended procedure is to 
coalesc2 two or more rows and/or columns. Based on Cramér’s 
modified x? minimum method, this paper develops means of 
combining individual cells instead of whole rows or columns. 
Specific results are given for a pair of cells in the same row (or 
column), a block of cells, two pairs which form a 2 X2 block, 
and two row pairs whose columns do not overlap. Results 
generally are simple if the sets of cells combined form a block 
or blocks which do not overlap. The complications that arise in 
the case of partial overlapping are illustrated for two row pairs 
which have a single column in common. The estimates of the 
probability of an item falling in each of the three columns on 
the null hypothesis are obtained from a quadratic equation 
which, however, always leads to a unique set of consistent esti- 
mates. 


NE of the perennial difficulties encountered by persons who try to 

follow the directions given in the numerous books on statistical 
methods in their own research is how to proceed with contingency ta- 
bles larger than 2X2 in which the expected frequencies in some of the 
cells are below the deadline of 5 or 10 (depending on the book). Fre- 
quently a worker is very reluctant to make the frequently recom- 
mended sacrifice of coalescing two or more whole rows or columns. It 
is beside the purpose of this note to summarize or even comment on 
the various investigations and discussions of the problem of small cell 
frequencies. But it may be useful to mention that the exact test avail- 
able for the 2X2 table in such cases can be extended to larger tables if 
one is willing to carry out the necessary computations, as has recently 
been shown by Freeman and Halton [1]. Also recent papers by Adler 
[2] and Cochran [3] will be helpful in understanding the present status 
of knowledge and opinion on this issue. 

Not long ago my attention was called to a young woman who re- 
solved the difficulty for herself by the very simple procedure of combin- 
ing a single cell with a too small expected frequency with an adjacent 
cell, treating the combination as a single cell and then computing a 
x? in the obvious mechanical way, reducing the degrees of freedom by 
one. It is no longer clear that the x? obtained in this way has (asymp- 
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totically) a x?-distribution with the degrees of freedom found in this 
way. This set me to thinking about the possibility of a correct pro- 
cedure of this kind developed from an acceptable mathematical model. 

The results which follow were derived by what Cramér has called 
the modified x? minimum method. For full details and in particular 
for a proof that a x? obtained in this way has asymptotically a x? dis- 
tribution, the reader is referred to Cramér’s book [4]. Briefly, for an 
rXs contingency table, under the null hypothesis of no association or 
of independence, the probability p;; that a randomly chosen individual 
belongs simultaneously in the ith row and jth column of the table 
(i=1, 2,---,7r;j=1, 2,---+, 8) is the product of the probabilities 
p;. and p.; that an individual falls into the ith row and the jth column 
respectively. Since 


D pie = DL pi = 1, 

i=] j=l 
there are r-+s—2 unknown parameters to be estimated from the data 
before the expected cell frequencies can be found from which a x? can 
be computed. As Cramér remarks the modified x? minimum method is 
equivalent to the use of the principle of maximum likelihood for the 
estimation of the p;. and p.;. If the observed frequency in the cell at 
the intersection of the 7th row and the jth column is »;;, then the prob- 
ability of occurrence of the observed cell frequencies on the null hy- 
pothesis is 


(1) C(pi.p-1)""(pi-p-2)"2 + + + (Dr-D.)’* 
= Cp1."- Per Dr.”*-D.y”-t eels p.a* 


in which 


e r 
vin = Dini, eG = DMs, 
tl 


jm 


Tr a 
> %. = Yvjgan, 


t=] j=1 


and C depends on the »;;’s but not on the p;.’s nor on the p.,’s. In this 
case the method of maximum likelihood consists of determining the 
values #;. and #.; of p;. and p.; respectively for which the expression 
(1) is maximized for fixed values of the marginal totals, »;. and »v.;, and 
then using those values as the sought estimates of the p;. and p.;. The 
equations which have the ;. and #.; as their solutions are 
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o(=-=)-=-=-o, i? eee eer | 


(2) j=m1 \ Di- Pr- Di- Dr 
Vij Vie V.g - - 
o(2-=)-2-2 =0, (=1,---,8-1) 
t—1 \D-5 Dus Pi = =D-s 
which give the usual estimates 
Vie Vii 
fo as jue — 
n n 


To apply this method to the case in which a pair of neighboring cells 
have been combined, we assume that s>2 and that the cells combined 
are the first two in the first row. That is, ».-++»:2.=w is known but not 
vy, and v2 separately. The null hypothesis to be tested becomes: 
E(v;;) =np;.p.;fori=1, --+,r,J=3, ---,s, and E(w) =np,.(p..+p.2). 
Now the modified x? minimum or maximum likelihood equations cor- 
responding to (2) become for rows 

















Vie Ve. 
ana dain = 0, ((=1,---,r—1) 
Pi- Pr. 
as before, while for columns they are 
t==2 ai *s a 0, 
Dat p-2 P.1 P-s 
»» Vi2 
WwW im? Vig 
—-— =0, 
(3) Dit pe P-2 D-s 
Vij Vig 
—_—— = 0, (j = 3,°-+,8— 1). 
P.i P-s 
The row system solves as before to give 
Vy. P 
hi. =—> (¢=1,--+-,7r). 
n 


To solve the column system we first multiply the jth equation by 
p.;,j=1, - ++, 8, and then add, getting 


Vis 


D-s 





r= ? 
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so that 
Vj 


fig es (j = 3,°--, 8). 
n 


Finally, the first two equations of the column system are solved sepa- 
rately, giving 
= (v.1 + v.2)(v.1 — v1) 








pr and 
(4) n(v.1 4+ v.22 w) 
F - (v.14 + v.2)(v.2 = V2) . 
" n(v.i t+ ve — w) 


With these estimates of the p;. and the p.;, the expected cell fre- 
quencies and x? are calculated in the usual way. The resulting x? has 
(r—1)(s—1)—1 degrees of freedom. The reader may have noted that 
the expression for #.; is the product of the proportion of the total fre- 
quency falling into the first two columns by the proportion of items 
in the first column to the total in the first two columns after the two 
cells that were combined have been deleted. It is obvious that the 
method here developed for the first two cells in the first row applies 
equally well to any two cells lying in the same row or in the same col- 
umn; if it seems simpler, one may first interchange rows and columns 
in the table until the cells to be coalesced lie in the position for which 
the formulas have been given. 

As an example, consider the following 4X3 table 




















Totals 
18 29 70 115 232 
17 28 30 41 116 
21 11 20 52 
113 111 176 | 400 




















constructed by combining the first two cells in the third row in an ex- 
ample used in Hoel’s textbook [5] for which both expected frequencies 
are less than 10 on the null hypothesis of independence. We have 

232 116 52 


fi. = 400° eo = ———) ps. = ———% 


400 400 
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and 
3 111 ; 176 
* 400° ‘400° 
as usual. But now, 
113 35 113 57 
a = —-—— = 0.1075, p.2 = ——-———_ = 0.1750". 
400 113 — 21 400 113 — 21 
Then 


E(vn) = 400f1.p.1 = 24.9, 
etc., for the single cells while 
E(w) = 40043.(f.1 + f.2) = 14.7. 


Then from the 11 cells, x2=17.9 which for 3.2—1=5 degrees of free- 
dom lies between the 1 per cent and 0.1 per cent points. 

This method is readily extended to the case in which the coalesced 
cells form a rectangular block. For example, suppose that only the to- 
tal w=vy+2+721-+72 of the 2X2 square in the upper left corner of 
the rXs table (r=3, s23) is given. One now has maximum likelihood 
equations of the form (2) for both rows and columns but the solution 
proceeds as before, giving, 


hi. 


(v4. oe V2.) (V1. =i — V12) 





’ 











n(r1. + vm. — w) 
= (vy. + v2.) (ve. — vor — v2) 
bs = tntan-o 
A (v.y 4+ v.2)(¥.1 — Yi — v2) 
(5) aie n(v.1 + v.2 — w) 
” (v.1 + v.2)(v.2 — vie — v22) 
oi n(v.. + v.2 — w) 
b=, (§ = 3,--+,7) 
Vij , 
nae (j = 3,---, 8). 


Generally if the first r; rows (r1<r) and the first s; columns (s;<s) 
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are combined with a total «f », we have 


s 


us | 
Do 4 Dev 
ton} 


jay +1 





p= SS, (= 1,---,n) 
n Zz. >» Vij 
t=] jmsy+1 


r a 
ie Vij > Vj 


- t=ry+1 i=l 





(6) Pit rs a , (j =1,---, 8) 

nd, do Mi 
t=<r +1 j=1 
Vy. 

pi. = —> @G@=n+1,---,7) 
n 
V.j ° 

pi=—) (j= 8+1,---,8). 


The resulting x? has (r—1)(s—1)—risi:+1 degrees of freedom. 

There is still no difficulty if two pairs of cells, each pair belonging to 
the same row and both pairs belonging to the same two columns, are 
combined. Suppose the totals v=vy+712 and w=vo1-+y2 are known as 
well as the separate entries in the remaining cells. We now get 


(v.1 + v.2)(v.1 — v1 — P21) 











es nv. + ve —v — w) 
(v.4 + v.2)(v.2 -— in = V22) 

Pas n(v.a tye —v — w) 

(7) WV. 2 

v.j . 

p.i4=—) (j = 3,-++, 8) 
n 
Vy. 

pi. = —> (¢=1,---,7r). 
n 


The resulting x? has (r—1)(s—1) —2 degrees of freedom. 

Likewise things work out simply if in each of two different rows 
a pair of neighboring cells are combined, the two pairs having no col- 
umn in common. Let v=n+712 and w=123+724. We find 
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Vy. 














ier“ (¢@=1,---,7r) 
bas (yy + v2) (va — vn) 
n(v.. + v.2 — v) 
tn (v.a + v.2)(v.2 — v2) ; 
8) n(v.. + v.2 — v) 
bax (v.g + v.4)(v.3 — es) 
n(v.3 + v.4 — w) 
ae (v.3 + v.4)(v.4 — Vea) 
n(v.3 + v4 — w) 
eit Gabe 


There is only a slight modification which by now the reader can 
make for himself in case the two coalesced pairs of adjacent blocks lie 
in the same row. If v=y,-+»2 and w=»13+714, we have 














bi G@=1,--+,7) 
ise (v.a + v.2)(v.1 — vn) 
n(v.. + v.2 — »v) 
— (v.1 + v.2)(¥.2 — v2) 
(9) n(v.y + vg — v) 
hei (v.g + v.4)(v.3 — v1) 
n(v.3 + v.4 — w) 
boo (v.3 + vs4)(¥.4 — via) 
n(v.g + v.4 — w) 
bya, (Gj = 5,-++, 8). 


The degrees of freedom for x? in both the last two cases is (r—1) 
(s—1)—2. 

There is no difficulty so long as the pairs or rectangular blocks of 
neighboring cells have all of their rows or columns in common or else 
have no rows or columns in common. As soon as there is partial over- 
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lapping of rows or columns, the likelihood equations are not so simply 
solved. Consider the simplest case of this kind in which v=7y+ 712 and 
w=ve2+v23. The likelihood (or modified x?-minimum) equations for 
rows are as simple as ever, giving 


























Vy. . 
pi. = —> (¢=1,2,---,r). 
n 
But the equations for columns are 
v 4 Vip Vy - Vig , 
Pat p-2 Dp. D-s 
v z Ww V.2 — Vio — Vo Vig 
+ + = ’ 
PitDp2 Pr2tDp-s p-2 Des 
WwW V.3 — Veg Vig 
= ? 
(10) D.2+ p-s D-3 D-s 
V.4 Vig 
Pe P 
p-4 Ps 
P-o- —Bg 
P -(e—1) D-s 


On multiplying by corresponding p.;’s and adding, we still get 





Vj : 
ps —> (j = 4,---,8). 
n 
To solve the remaining three equations set 
a“ =a te = b, c=1l1—a-}); 





Dit pat ps Pit pat ps 
vitve tvs =m; and 
V.1— Mn = Aj, V.g — Vig — Veo = As, V.3 — Veg = As. 
Then on eliminating a from the first and third equations by means of 
A,(1 — c) 


m(l—c)—v 





(11) a= 


we have the quadratic for c: 
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n(n - <p A))c? - [(m “= v)(m - w) = m(Ay neni As) 
_ A,As|c + (nm =— = A))A3 = 0, 


(12) 


It can be shown by some fairly messy but elementary algebra that the 
two roots of this equation are both real and are positive proper frac- 
tions. Further the larger root cannot be used since then, using (11), we 
get a+c>1. Thus, taking the smaller root of (12), we get a unique set 
of values for a, b, and c which uniquely determine p.1, p.2, and p.3. The 
degrees of freedom for x? are still (r—1)(s—1) —2. 


REFERENCES 


{1] Freeman, G. H., and Halton, J. H., “Note on an exact treatment of contin- 
gency, goodness of fit and other problems of significance,” Biometrika, 38 
(1951), 141-9. 

[2] Adler, Franz, “Yates’ correction and the statisticians,” Journal of the Ameri- 
can Statistical Association, 46 (1951), 490-501. 

[3] Cochran, W. G., “The x? test of goodness of fit,” Annals of Mathematical Sta- 
tistics, 23 (1952), 315-45. 

[4] Cramér, Harald, Mathematical Methods of Statistics. Princeton: Princeton 
University Press, 1946. Sections 30.3 and 30.5, pp. 424-34 and 441-45. 

[5] Hoel, P. G., Introduction to Mathematical Statistics. New York: John Wiley & 
Sons, 1947, p. 192. 





APPROXIMATING THE MODE FROM WEIGHTED 
SAMPLE VALUES 


Howarp L. JonEs 
Illinois Bell Telephone Company 


The weighted mean of ordered sample observations can be 
used to approximate the mode under favorable conditions, 
where the weights are determined from the first two terms in a 
Taylor expansion of the maximum likelihood estimate. Such 
weights are shown in Table 1 for the case where the sample is 
se.ected from a t-distribution with known kurtosis. 


INTRODUCTION 


NE of the oldest and most difficult statistical problems is that of 

deciding how much weight to assign to extreme sample observa- 
tions in trying to estimate the typical or average measurement for 
some population. It may be regarded as a part of the more general 
problem of how to make estimates on the basis of observed sample 
values after such values have been arranged in order of size. 

Sample values thus arranged are now commonly called order statis- 
tics; and estimates computed therefrom, as well as other functions of 
these statistics, have been termed systematic statistics [4]. Weighted 
sums of sample values arranged in order of size may therefore be called 
linear systematic statistics [3]. Examples are the sample median, where 
a weight of zero is assigned to each order statistic except for one or two 
central values (depending on whether the sample size is odd or even), 
and the midrange where the first and last order statistic is each weighted 
one-half and every other order statistic is weighted zero. The sample 
mean may also be regarded as a systematic statistic where each order 
statistic is assigned a weight equal to the reciprocal of the sample size; 
although in this case, the equality of the weights makes it immaterial 
whether the sample values are ordered or not. 

The advantages of linear systematic statistics have been discussed 
by various writers. It is known [1, p. 483] that the sample mean is the 
best unbiased regular estimate of the mean of a normal distribution. It 
also tends to have desirable asymptotic properties in large samples 
from most types of distributions that statisticians have studied. But 
even where other types of functions are more efficient in the sense that 
for a given sample size they lead to unbiased estimates with a smaller 
variance, a linear systematic statistic may have important advantages 
in practical problems. Chief among these are the simplicity of the com- 
putations, and the economy of time and expense in situations where 
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samples of any desired size can be selected and measured with relative 
ease [2]. These advantages explain why the range (a systematic sta- 
tistic in which the extreme order statistics are weighted —1 and +1), 
instead of the root-mean-square, is generally used to estimate the 
standard deviation of a distribution in quality control applications. 
Some work has been done on problems of measuring and maximizing 
the efficiency of various linear statistics, particularly for samples from 
a normal distribution [5]. 

A deterrent to the use of these statistics is the difficulty of finding 
satisfactory weights. The general problem was discussed several years 
ago in an unpublished paper, “The Weighted Mean of Random Ob- 
servations Arranged in Order of Size,” which was written jointly with 
John H. Smith and recently reproduced in Ditto form as National 
Bureau of Standards Working Paper SEL-51-1, February 1951. The 
weights which lead to the best unbiased linear estimates of the mean, 
mode, median, or any other location parameter can usually be deter- 
mined when the first and second moments and the product moments 
of all the order statistics are known. Since these moments are difficult 
to evaluate for many typical distributions, a general method was given 
for finding weights that would lead to a first approximation of the mode 
under certain favorable conditions. The present article is the result of 
Churchill Eisenhart’s desire to have this method published more 
widely, since it has been found to be useful in broader applications. 

In order to illustrate an application of the method, the weights 
shown in Table 1 were computed for various sized samples selected 
from a ¢-distribution with varying degrees of kurtosis. Hence, if one 
knows that the parent distribution is approximately of the same shape 
as some form of the ¢-distribution, and if the approximate kurtosis of 
this distribution is also known, the mode can be estimated from a sam- 
ple of size 10 or less by arranging the sample values in order of size and 
then applying the weights shown in the table. If the kurtosis is 3, indi- 
cating a normal distribution, the weights are all equal, as they should 
be. If the kurtosis is large, the median is weighted more heavily than 
the extreme observations. 

While the mode of a distribution is generally of less interest than the 
mean or even the median, there are many distributions like the t-dis- 
tributions where these location parameters coincide. The estimated lo- 
cation of the mode may also be useful in other situations where its re- 
lationship to other parameters like the mean or median is known. The 
mode is emphasized here merely because it is important in the pro- 
posed method for developing the weights. 
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, TABLE 1 
WEIGHTS FOR APPROXIMATING THE MODE OF A 
sta~ t-DISTRIBUTION 
PE), 
the Size Rank 
SQ = 2 
me. of i Kurtosis:* a4 = ys/p2 
ing Sam- Sam- 
om ple ple 3 3.5 + 4.5 5 6 9 2 
3 2 .84 4.86 .86 #188 .88 ##.40 .40 ~~ .42 
ing 13 .33 .82 .82 .31 #81 «.30 8©.30~©6.29 
ars 
Yb- 4 2,3 .2 .26 .28 .28 4.29 .29 © .80.~ «31 
th 14 .25 .24 22 .22 21 21 © .20. 119 
nal 5 3.20 .22 .22 .24 124 124 8 ©.26 28 
he 2,4 .20. .21 22 .22 22 .23 .23 ~~) = .24 
in, 1s 2 6 7m 6 6 6 as 
or 
“a 3,4 .17 .18 .19 20 .20 .21 122 ~~ .23 
, 6 25  .17 117 118 .18 118 18 18 §©.18 
ult 16 .16 .15 18 .12 .12 .11 © ©.10 ©§©.09 
en 
de 4 14 1.16 16 6.18 §©.18 .18 .20 8.20 
of 3,5  .15 .16 .16 117 117 118 = =©.18 ~§©.19 
i 7 26 .14 1.14 «1.15 114 1.15 114 114) 114 
17 .14 1.12 .11 6.10 86.09 0 ©=.09) = .08—Ss(«w07 
ts 4,5 13 14 15 15 .16 .16 .17 18 
d 8 3,6 .18 .14 1.14 115 115 1.15 115 3.16 
“ 27 ££ 8 2 se.sese 2 © Al 
nf 1,8 .12 .10 .09 4.08 .07 .07 .06 8.05 
of 5 12 .12 114 114 (14 ©1140 ©1160 ~)©6©.16 
\- 9 46 11 .12 118 114 114 114 8.15 ~= .16 
d 3,7 11 18 .12 .12 .13 .13 .13 .13 
7 ae ae ee ee oe oe ae 
7} 19 09 .07 1.07 4.06 .06 .04 8.04 
? &6 .10 4.11 86.8 08668) (OS lhl 
47 .10 .12 112 1120 ©6©6.120©«©6618)=618Ss«id 
2 10 38 .10 .10 tk .tl ML) Ll) WO 
" 29 10 .10 .09 4.09 .09 ©.08 08 8 .07 
1160 .10 .08 .06 .06 .05 4.05 .04 ~»&.08 


se 





. : * The column headings are the values of a, for the parent distribution, not for the sampie. 
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DERIVATION OF THE WEIGHTS 


For some infinite population, assume that the distribution of a meas- 
urement z is described by the probability density function 


(1) y = f(z ti 61, 62, O3,°°° ? Ox), 


where 6;, 02, - - + , & are parameters of the distribution, the form of the 
function being such that @, is a location parameter. Assume, for every 
admissible value of z where y>0, that the derivative y’=dy/dz exists, 
that y has a unique mode at x=4,, and that y’/y is analytic. Let y; and 
y;’ respectively, denote the value of y and its first derivative at the 
point z=2,; and let 


(2) o(z:) = — yi'/Yi, 
(3) ¢’(xi) = do(x;)/dz,. 
Assume that for every pair z; and 2;, the ratio 


(4) Ris = $'(x:)/$'(2)) 


for 6;=0 is known; that is, the value of R;; does not involve an unknown 
parameter. Then weights for approximating the mode can be developed 
as follows. 

Let 


(5) Yr = f(r — 61, 2, +++ , Ox) 
where 7; is the rth order statistic in a random sample of size n from dis- 


tribution (1). The maximum likelihood estimate of 6; when 42, 3, - >: , 
6, are known is, in general, a solution of the equation 


a] 
6) Sp, 21 low (ae — Oy Ory ++ 5 Oe) = 0, r=1,2,---,7, 
O71 


where >, denotes summation over all n values of r. From the juxta- 
positional relationship between z and 4, in (5), 


) d 
— log f(t, — 91, O2,°++, 9 - log f(t, — 61, 02, +++ , %) 
(7) 06, gf( 1, % ie) an, g f( 1, [2 k 


- $(2r), 


where ¢(2,;.) is defined by (2). Combining (6) and (7), we obtain 


(8) DX (27) = 0. 


Let 9; be a value of 6, that satisfies (8). Expanding ¢(6;) in the neigh- 
borhood of 2, yields 
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(9) (0) = (te) + (6: — 2r)b’(2e) + 4G — 2r)°6"(ar) +++, 


where ¢’(zr), $’’(z,), . . . denote successive derivatives with respect to 
z,. Neglecting all but the first two terms of the expansion (which pro- 
cedure may introduce considerable error when §;—2; is large), and 
summing over the n observations, we obtain the approximate relation- 
ship, 


(10) nob) ~ >> o(2r) + Do (0: — arb" (ar), 


or after combining with (8), 
(11) no(bi) ~ D2 (8 — 22)b"(2r). 


Now suppose that 6,=6;, the mode of distribution (1). (In general, this 
supposition will be true only to a first approximation.) Then from (2), 


(12) ¢(8:) = 0. 
Combining (11) and (12) leads to 

(13) 0 = Do wre 
where 

(14) wr = o'(x-)/ D1 o'(ar). 


For a normal distribution, ¢’(z,) = 1/0? for every z,; hence, w,-=1/n 
for this case, which is as it should be. In general, however, ¢’(x) varies 
with z; and in order to make use of (14) to compute a linear estimate 
of the mode, we must choose some value of ¢’(z,)—say ¢’(¥,)—that is 
typical for the order statistic z,. In most applications, perhaps the sim- 
plest choice is the one where *, is the mode of the distribution of z, in 
a sample of size n from a parent distribution of the specified form with 
the mode chosen as the origin so that 6; does not appear in the general 
expression for w,. This value of z, can usually be determined or approxi- 
mated when the form of the parent distribution is known. 

Let F(x) denote the cumulative probability distribution of x (that 
is, the probability that a random selection from distribution (1) will 
be equal to or less than a particular value of x). Then the distribution 
of the order statistic x, (where 2,S2,41) may be written as 


(15) "__ tre) if — Fa) 
Le) | — Zr) |\"-Ty,. 
(—-Din—n! " 
To find ¥,, the mode of this distribution, we set the derivative equal to 
zero. After dividing by 
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n! 
(r—lln-—r 





y! [F (2+) [1 — F(x) }>*y-*, 


we obtain the equation 


r-—l n-?T ’ 
+= = 0. 


(16) F(t.)  1—F(a%) ye 





This equation can generally be solved for z,, for r=1, 2, -- - ,n, by an 
iterative procedure. 
COMPUTING THE WEIGHTS FOR THE {-DISTRIBUTION 


The probability density function for the ¢-distribution may be writ- 
ten in the form 





k+1 
r( 2 ) (t nd 6) —(k+1)/2 
(17) y= [2 + 


(Qe 

) Tv 

where k is the number of degrees of freedom. Since the mode is at 
t=6, we set 0=0 and obtain the functions 


(k + 1)t 
18 y Se ee oT ae 
(18) y rarer, 
(k + 1)t- 
k+t? 
(k + 1)(k — t,?) 
(k + t,?)? 

From (14) and (20), the desired weights are seen to be proportional to 
(k—#,?)/(k+4,?)2, where #, is a typical value of t,. 

Let us first consider the case where k=1. After dividing by 7, 
equation (16) for this case becomes equal to 


(19) o(t,) = 


(20) $'(tr) = 





| jus 
(21) r n—-?T 








_ — 2t.=0 
$47 + arc tan t, 3m — arc tan ¢, , 


where the angles are measured in radians. For n=5, solving (21) for 
1, 2,---, 5 successively by an iterative procedure and employing 
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(20) and (14) to find the corresponding weights, we obtain the follow- 
ing approximations: 








v 


t, ¢'(E,) Wr 





— .8738 1521 .03 
— .3689 1.3387 27 
-0000 2.0000 -40 
-3689 1.3387 .27 
.8738 1521 .03 





The weights shown in the last column indicate that in a sample con- 
sisting of five observations from this distribution, the sample median 
should receive the greatest weight (.40) and the two extreme terms very 
little weight (.03) in estimating the mode. 

Suppose we make the transformation 





(22) k= 2p, — V2p(1 a x)/z, 
for t>0. Then for x=2,, (20) reduces to 


2 1 
(23) o' (Zr) = : + 2,(22, — 1). 
2p 


The weights should therefore be proportional to #,(2¥%,—1), where %, 
is the value of x corresponding to the mode of the distribution of ¢,. 
Since p and z have the same meaning here as in Pearson’s Tables of the 
Incomplete Beta Function, we write 


(24) F(t) = 1 — $1(p, .5), 


1 dy- J/1 — 2 
25 -= — (2p + 1) ——— B(p, .5), 
(25) a? dh (2p + 1) Me (p, -5) 


where I(p, .5) and B(p, .5) also have the same meaning as in Pearson’s 
tables. Equation (16) can therefore be written as 


r—1 n-—f?r (2 +p) t= 5) =0 
1—-H@,5) H,5) lait: 
and approximate solutions for r=1, 2, - - - >4n can be found by inter- 
polating in those tables. These solutions can be taken as the values of 
zt, to be substituted in equation (23) in computing ¢’(z,) and w,. 


(26) 
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The procedure just described was used in arriving at the weights 
shown in Table 1. The kurtosis shown in the column headings is re- 
lated to the number of degrees of freedom by the equation, 


(27) a, = 3(k — 2)/(k — 4), for k > 4, 


COMPUTING THE WEIGHTS FOR A POLYNOMIAL DISTRIBUTION 


The properties of the estimates discussed in this article have not been 
investigated for the general case. Investigation for cases where the dis- 
tribution is a polynomial in z is relatively easy, however, provided the 
sample size and the degree of the polynomial are small. The following 
example shows what happens in a particular situation of this kind 
where the distribution is skewed. 

Consider the distribution of the simple form 


(28) z= 12u°(1 — wu), 0OsuSsil, 


which is a special case for the derivative of the Incomplete Beta Func- 

tion. We first note that the mode is at u= 3. Distributions of this same 

general form may therefore be obtained by making the transformation 
= 6; 2 


(29) u= rn 3 





Since du/dz=1/@:, equation (28) becomes 








mio tom, 
6. 62 3 3 62 
with z satisfying the inequality 
(31) er eee 
3 3 


This distribution has its mode at x=6,, and is of the form prescribed 
in discussing distribution (1). 

After setting 6,;=0 and 6.=1, we may proceed as previously suggested 
and write down the relationships! 


(32) Y, = - (1 — 32,)(2 + 32,)?, 





1 The computations can be carried out without assigning the arbitrary value of i to the scale 
parameter, 6:. The effect will be to multiply the resulting values of ¢’(Z,) by the reciprocal of 6:°. 
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(33) yr’ = — 122,(2 + 32,), 


272, 
(1 — 32,)(2 + 32) 
27(2 + 92,?) 
(1 — 32,)*(2 + 32,)? 





— yr’ /Yr = 


(34) (Zr) 





(35) $' (ar) = 


From (14), the desired weights are to be proportional to (35) for r=1, 
2,:++,m, where 2, is a typical value of the rth order statistic. 

Suppose the sample is of size 3. Employing (16), we may find the 
modal values of 21, 22, and x3 from the relationship 








r-l 3-r 243%, 
(36) - - = 0, 
where 
2 1 
(37) i 4 £, < ee 
3 3 
and 
a 1 
(38) F(%,) = f yidz, = — (2 — 34,)(2 + 3%). 
2/3 2% 
We may also write 
1 
(39) 1 — F(%,) = a (1 — 3#,)2(11 + 18%, + 94,7). 


Putting r=1, 2, and 3 successively in equation (36) and clearing of 
fractions, we obtain the equations 


(40) 297%,2 + 594%,2 + 387%, + 64 = 0, 
(41) 891%! + 1188%,3 — 27%,2 — 582%, — 20 = 0, 
(42) 99%;? — 66%; + 8 = 0. 


The roots of these equations satisfying (37) have the approximate 
values 


(43) #, = — .2481851535, 
(44) H = — 0345011424, 
(45) #3 = .1592556774. 
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Substituting these values for z, in (35), we obtain the approximations 


(46) $¢'(4:) = 14.37744535, 
(47) '(#s) = 12.39545132, 
(48) '(#s) = 35.9319163. 


Using these values in (14), we find the approximate values of the weights 
to be as follows: 


(49) w, = .229287748, 
(50) ws = .197679424, 
(51) ws = .573032828. 


In using these weights, we are likely to be especially interested in 
three properties of the resulting estimates of the mode, these proper- 
ties being defined by the following relationships: 


(52) Bias = E[6,] — 41, 
(53) Variance = E[6,?] — (E[6:])?2, 
Mean Square Error = E[(6, — 6)?] 


(54) ; , 
= (Bias)? + Variance, 


where the symbol EF denotes average or expected value. For this par- 
ticular case where the distribution is described by equation (30), these 
properties when the weights in (49), (50), and (51) are used to estimate 
the mode are indicated in Table 2 in the column headed “‘Maximum 
likelihood’ estimates.” For purposes of comparison, the same proper- 
ties are shown for two other weight systems, the second system being 
that which yields the “best unbiased estimates”—that is, the unbiased 
linear estimates with minimum variance—and the third being the sys- 
tem which yields the linear estimates with minimum mean square er- 
ror.? The methods of deriving these weight systems and their proper- 
ties are outlined in the appendix that follows. 

It should be kept in mind that the comparison shown in Table 2 ap- 
plies only to distributions of the form described by equation (30), and 

2 It is assumed here that attention is restricted to linear estimates for which the sum of the weights 
is equal to one, and that there is no a priori information about the size of the parameters (aside from the 
implied specification that 6: is positive). Under other conditions, the weights shown in the last two 
columns of Table 2 may not be optimum with respect to the properties indicated in the column headings. 


For example, if @; is known to satisfy the inequality 
—.000000001 <é@: <.000000001 





and 6; is known to be large, say 

6: >100,000, 
then the estimates obtained by employing the weights shown in the last column of the table would not 
be as good as estimates obtained by setting all the weights equal to sero. 
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may not be indicative of the relative merits of different methods of 
estimating the mode for other types of distribution. 

TABLE 2 


COMPARISON OF THREE LINEAR SYSTEMATIC ESTIMATES 
OF THE MODE OF A POLYNOMIAL DISTRIBUTION* 











“Maximum “Best” unbiased Minimum mean 
likelihood” prin square error 
estimates estimate 
Weights 
w1 . 229287748 . 232616040 . 241695273 
We - 197679424 . 133288817 . 134304834 
Ws . 573032828 .634095143 .623999893 
Bias — .0092025370, 0 — .0032760926. 
Variance .012464878¢,? .0124789160,? .01245491 40,2 
Mean square error .0125495640,? -0124789166,? .0124656476,2 





* The values shown are for the particular polynomial distribution described by equation (30). The 
variance of this distribution is .040,?. 


APPENDIX 

Lower moments of the order statistics 

To investigate the properties of various linear systematic estimates 
of the mode of the distribution described by equation (30), we first 
derive the lower moments of the order statistics. Consider the distri- 
bution described by equation (28). For a sample of size three from this 
distribution, the moments and product moments of the order statistics 
may be obtained from the general formula 


1 ug ug 
(55) E[us*ue!us*] = 3! f f f Uy ug Ug*yiYyoy sduiduedus 
g=0 HY ug=0 Y uy =0 


u 


where 
(56) y= 12u,?(1 ead ui), i= ) 2; 3. 
It turns out that 
(57) EI 2127 EI 3039 Blus] 3843 
U =_l eo =e Uu =_l eee 
5005. ma) 5005" *1 5005” 
1028 1948 3030 
58 E 7) = — E 2) = —— E 2; = ———-.. 
(58) [us ] 5005 [we ] 5005- [us | 5005 
6777 333 477 
59 E 5 <eemeeaes 9 E soe) E =e 
(59) [mien] = 3096 [uaa] = 001 [meus] = soon 
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The lower moments of order statistics for samples of size three from the 
distribution described by equation (30) can now be obtained by direct 


substitution of the right side of (29) for u in 


(57), (58), and (59), since 


there is a linear relationship between u and z. Thus, we obtain 














(60) E[x] = a oe =, 62, 
15015 
E[z:] =6,— Read 62, 
15015 
15015 
(61) E [x,?] = 6? — boon 6162 ~ 627, 
15015 45045 
E[z.?] = 6,2 — Rod 610. = 622, 
15015 45045 
ses) = 09 + a ans 174 4, 
15015 45045 
(62) E[2,22] = 6,2 — 4522 60. 6113 622, 
15015 225225 
E[azs] = 6;? = = 0102 — _ 622, 
3003 9009 
E [22251 > 6,? +- =. 6:02 ot. = 6,7. 
15015 45045 


From these values, the variances and covariances of the order statistics 


are found to be 





(63) ot = Elz) — (B[x,))* = — 2 0, 
25 050,025 

opwthet~ eee ~ Sa 
25 ,050 ,025 

os” = E[xs*] — (E[z])? = al 02, 


25 ,050 ,025 








i the 
rect 
ince 


tics 








APPROXIMATING THE MODE FROM WEIGHTED SAMPLE VALUES 125 
319 ,824 
= E\a2.|) — E\x,|E\z2| = ——————- 0? 
(64) Tia [x22] [x1] E [ze] 25,050,025 
159 , 264 
= E\x23| — E|ajE\|rz3| = ————__ 0”, 
ou = Elzits] — Eln|E [a] 25,050,025 — 
258 ,048 
=F — Elas|E = ——_—_—__ 9’, 
ou = Elevate] — Elta]E las] = ons 
Lower moments of linear estimates of the mode 
A linear function 
(65) 0; = wits + wet, + wets 


of the order statistics x;, 72, x3 in a sample of size three from the distri- 
bution described by equation (30) has the mean value 


(66) E[@,] = w.E [21] + weE [22] + wsE [zs]. 
Its variance is 
E[6,] —(E [8,])2 = W203? + wWe2ae? + ws? 
+ 2(wiweer + wiwse1s + WeWs023). 


Combining (66) with (60), and (67) with (63) and (64), and employing 
the weights in (49), (50), and (51), we obtain 


(68) E[6,] = 6, — .0092025376., 
(69) E[6,2] — (E[6,])? = .0124648780,. 


The bias , obtained bysubtracting#from (68), istherefore — .00920253702, 
while the variance is given by (69). Adding the me of the bias to 
the variance, we obtain 


(70) E[@: — 6:)2] = .0125495640,2, 


(67) 


which is the mean square error of the estimates. These results are 
shown in Table 2 in the column headed “ ‘Maximum likelihood’ esti- 
mates.” 

To yield unbiased estimates of the mode of distribution (30) for all 
values of 6; and 6, the weights w:, we, ws must satisfy the condition 


(71) wiz [x1] ft wok [x2] + w3E [xs] = A; 


for every 6, and 62. Combining (71) with (60) leads to 
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( 3629 ») m ( 893 ) 
w _ w —- —— 
15015 "A" 15015 


“i ( m 1519 ») P 
w = hi, 
; *\ " 16015 , 


which must be satisfied identically. Hence, the weights must satisfy 
the two conditions 

(73) Wi + we + Ws; = 1, 

(74) 3629w, + 893w. — 1519ws = 0. 


Combining (67) with (63) and (64), the variance of any linear function 
of the order statistics 2, 22, and 23 is found to be 


(72) 


6 2 
on ee {621,011w,2 + 514,219w.? + 396 ,501ws? 


+ 2(319 ,824wiws + 159, 264wiws) + 258 ,048w2ws) }. 


To find the regular unbiased linear estimate with minimum variance, 
we write down the expression 


621 ,O11w;? + 514,219w.? + 396 ,501ws? 
(76) + 2(319 ,824w:22 + 159 ,264wiws + 258 ,048wews) 
+ Ay(wi + we + wz — 1) + ro(3629w, + 893w, — 1519w;), 
where \; and d, are Lagrange multipliers, and set its partial derivatives 


with respect to w;, We, w3 each equal to zero. After eliminating \; and 
de, we obtain the equation 


(77) 613w, — 2497w2 + 300w; = 0. 


Solving (73), (74), and (77) simultaneously for w, we, and ws yields 
(78) w, = 3,525,043/15, 153 ,912, 
(79) we = 2,019, 847/15, 153,912, 
(80) w3 = 9,609 022/15, 153 ,912. 


The variance and mean square error of the estimates computed with 
these weights is shown in Table 2 in the column headed “ ‘Best’ un- 
biased estimates.” 

The mean square error of the estimates considered here may be found 
directly by setting 6, =0 in equations (61) and (62) and then computing 
the value of the expression 
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w2E [22] + we2E [x22] + ws2E [252] + 2(wwek [2122] 
(61) + wwsE [x23] + wsE [x22]), 


which is now equivalent to 


6 2 
(82) 225 a (18, 740w,? + 5,420u,? + 5,870w;? 


+ 12,226wiw2 — 8,150wiws + 1,930wews). 
The regular linear estimate with minimum mean square error may be 
found by writing down the expression 
18 ,740w,? + 5,420w2? + 5,870ws? + 12,226wiwe 
— 8,150wws; + 1,930wsws + A(wi + we + ws — 1), 


and then setting the partial derivatives with respect to w;, we, and ws 
each equal to zero. After eliminating A, we obtain the equations 


(84) 12,627, + 693w. — 5,040w; = 0, 

(85) 10,188w, + 4,455w, — 4,905w; = 0. 

Solving these two equations and (73) simultaneously leads to 

(86) = 18,095/74 867, 

(87) = 10,055/74,867, 

(88) = 46,717/74,867. 

The bias, variance, and mean square error of the estimates obtained 
from these weights are shown in the last column of Table 2. The weights 


for all three types of estimates covered by the table are shown there in 
decimal form to permit ready comparison. 


(83) 
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ESTIMATING THE RATIO BETWEEN THE PROPORTIONS 
OF TWO CLASSES WHEN ONE IS A SUB-CLASS OF 
THE OTHER 


Jack M. ELKIN 
U. S. Railroad Retirement Board 


1. Consider a population of size N, of which p,N items have the 
characteristic A, of which, in turn, p2N also have the characteristic 
B; ie., the proportion of A-items which are also B-items is p2/p,. 
For example, p; may be the proportion of individuals eligible to retire 
among a group of individuals covered by a retirement program at the 
beginning of the year and p:/p; the proportion of eligibles who retire 
during the year, or p: may be the proportion of families in a given in- 
come class among a group of families in all income classes and p2/p, 
the proportion of families in the specified income class who own their 
own homes. A straightforward problem is one of estimating p2/p, 
from a simple random sample of size n drawn from N. The problem as 
stated in the title is, of course, equivalent to one of estimating the ratio 
of the overlap between two overlapping classes to one of the classes. 

Sometimes the actual value of p: or pz is known beforehand. On the 
“theory” that it is always better to use a known than an estimated 
datum, one might be tempted to estimate p2/pi by estimating the 
unknown proportion from the sample and combining that estimate with 
the known proportion. This, however, is not necessarily the best course 
to follow. If we consider the “better” of two estimates to be the one 
with the smaller coefficient of variation, it can be shown, for a suf- 
ficiently large sample, that 

(a) even if p2 is known, a better estimate of p2/p: (or of p:/p2) 
can be obtained by estimating both proportions from the sample if 
p2>pi/(2—p1), or, alternatively, if p1<2p2/(1+p2), and 

(b) even if p; is known, a better estimate of po/p: (or of p:/p2) 
can be obtained by estimating both proportions from the sample, 
regardless of the values of p: and p>. 

The criterion in (a) cannot be applied exactly in practice, since 7; is 
not known, but its sample value can almost always be satisfactorily 
substituted for the universe value in the inequalities. 

By way of illustration, suppose that, on the basis of a simple random 
sample taken of 100,000 employees covered by a pension program at 
the beginning of a year, it is estimated that 5,000 were eligible to retire 
during the year and that 3,000 of these did retire. Suppose, further, 
that the exact number of those who retired is known, from another 
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source, to be 3,200. Since .03 > .05/(2—.05), the number of retirements 
as a proportion of the number eligible for retirement should be quoted 
as ¢ or 60 per cent, rather than 3.2/5 or 64 per cent. On the other 
hand, suppose that the sample estimate of the number who retired is 
2,000, but that the exact number is known to be 2,100. Since .02 | 
<.05/(2—.05), the proportion should be quoted as 2.1/5 or 42 per 
cent rather than ? or 40 per cent. If it is the number eligible to retire 
that happens to be known exactly from another source, it should 
nevertheless be disregarded, and the proportion should be calculated 
from the sample numbers in all cases. 

The conclusions stated in (a) and (b) above follow easily from a com- 
parison of the coefficients of variation of ps'/pi', p2/pi’, and p2'/pi 
(or of their reciprocals), the dashes being used to indicate sample 
values. The expected value, variance, and coefficient of variation of 
p'/pi' are given, respectively, by 


z(? =) _ 2 

pi Pi 

v(=) _ N—1 prlpi — Pr) 
pi Nn pi* 


pe’ 2 N-n/1l 1 
enfiynBana 2 
Pi Nn P2 Pi 

where the approximations assume that n is sufficiently large.’ If the 


reciprocal of p:’/p,’ is expanded in a Taylor series in the neighborhood 
of the expected value of p2'/py’, it can be seen that, for sufficiently large 


n, 
sft }=” 
poi pa 
v(=) .N— piri — P) 
D2’ Nn p2* 


. ‘\ 7? \]%: N—-n/fl 1 
[o-v. (5) = [e-7.(F2)] * GaGa: 
D2’ py Nn \p2 pi 


Likewise, if the Taylor series expansion is applied to 1/py’, it follows 
from the well known expressions for E(p;’) and V(p1’) that 




















1 W. E. Deming, Some Theory of Sampling, pp. 452-54, quoting a proof by F. F. Stephan. Stephan’s 
proof excludes the possibility of a sero value for the denominator proportion. 
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1 1 
(+3 
pi Pi 


v(—) += et. 
py Nn pi® 


ex (2) [or 2S) 


For completeness, we have the familiar formula, 


[C.V.(p2’) ? = [e-v. (= )| a “= *(- ” 1). 


The approximate formulas for the coefficients of variation are asymp- 
totic expressions and so are the efficiency comparisons based on them. 

2. A useful application of these results can often be made, even 
when the parameter under investigation is simply the proportion of 
cases in the universe possessing a specified attribute.? Thus, suppose 
the proportion (pi) of A-items is desired, the proportion (ps) of B- 
items is known, the B-items are a sub-class of the A-items, and 
p1>pi /(2—py), or, alternatively, p:’<2p2/(1+p2), (where py’ is the 
sample value for p,). Under these conditions, it is preferable to obtain 
the sample value (p2’) of pz from the same sample (assuming the 
B-items can conveniently be counted along with the A-items), and 
use 71'p2/p2' instead of p;’ as the estimate of p;. The advantage of this 
procedure can be measured by the ratio 


v. (=) 
D2! i [ Pi — Peo ie 
C.V.(py’) p(l—p)t - 
For example, if p1=.2 and pe=.15, C.V.(pi'p2/p2’) =.6.C.V. (p7’). 
Similarly, suppose the proportion (p2) of B-items is desired, the pro- 
portion (p1) of A-items is known, and the B-items are a sub-class of 
the A-items. It is preferable to obtain the sample value (p;’) of p: 
from the same sample (assuming the A-items can conveniently be 


counted along with the B-items), and use p2'p:/p,’ instead of pz’ as 
the estimate of ps. The advantage of this procedure can be measured 


by the ratio : 
CY. (? . ) 
pr’ 


mm [ Di — Pe in 
C.V.(p2’) pil — pod - 
For example, if p;=.2 and p2=.15, C.V.(pa'pi/p1’) =.5 C.V. (py’). 


2 See also W. E. Deming, op. cit., pp. 165 ff. 
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THE CIRCULAR NORMAL DISTRIBUTION: 
THEORY AND TABLES 


E. J. GuMBEL 
Consultant, Stanford University 


J. AkTHUR GREENWOOD 
Manhattan Life Insurance Company 


Davip DurRAND 
National Bureau of Economic Research 


0. PROBLEM 


HE recognition of periodicities, their amplitude and length, is one 
Te the main tasks of the application of statistics to long-range data 
and especially to economic phenomena. Although numerous theorems 
have been developed, the question of how to recognize a cycle remains 
open, since the complicated mathematical methods used in the peri- 
odogram and correlogram analysis may generate both real and fac- 
titious cycles and may conceal real ones. 

Therefore, the study of cycles is a fertile field for nonscientific pro- 
cedures. Since even a constant value in a certain interval can easily be 
reproduced by a Fourier series, cycles with superimposed epicycles and 
third and fourth cycles have been “found” in nearly every domain that 
may be represented by numbers. Different observers have found differ- 
ent cycles in the same series—not to mention the great cycles of history 
leading to forecasts, good advices and policies advocated by their 
prophets. This abuse has discredited the whole theory of cycles. 

In contrast to these procedures, we examine here a specialized dis- 
tribution function. Applications to recurrent phenomena of known 
period will be given in a second article. Since the distribution function 
has only two disposable parameters, and since we consider using it 
only in connection with such phenomena as seasonal or diurnal varia- 
tions—where the length of period is definitely established from non- 
statistical considerations—we feel that the chance of generating false 
cycles is small indeed. 

The distribution function under discussion was tabulated because 
it possesses important theoretical properties of the linear normal dis- 
tribution and because it has practical applications to such meteoric 
phenomena as the direction of the wind, which is an angular or “cir- 
cular” variable. Wind directions are represented clockwise on the 
map from north to east, south, west, and back to north in the same 
Way as an angle varies from zero to 27. Each direction or angle has a 
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certain frequency and the sum of the frequencies is the number of ob- 
servations, diminished by the number of calm conditions. The question 
then arose of extending the use of this function to time series with 
seasonal, or possibly diurnal, components. Assume n events happen 
during a year, or a span of several years. These events might be storms, 
or deaths, or +he completion of automobiles. Each event happens at a 
certain date. If this date is considered as the chance variate a, the 
distribution of events over the year may be considered a circular dis- 
tribution of an angle. 


1, STATISTICS OF A CIRCULAR DISTRIBUTION 


We first consider the purely empirical procedures which are used 
especially by climatologists in order to characterize circular distribu- 
tions. These descriptive measures will prove basic for the construction 
of the theory and the estimation of the parameters. 

Let n points be situated in some fashion on the circumference of a 
unit circle: then a,(v=1, 2,---+,m) are the angles of the radii drawn 
from the center. To characterize such circular observations their vector 
mean has been used: this is clearly identical with the centroid of the 
set of n points. To calculate it we introduce the rectangular coordinates 
Z,=COS a,, ¥,=sin a,. Then 

i 2 a. 
(1.1) Z=—)ix, J=—Ly 
1 nm 1 
defines the vector mean. 

For our purposes it is, however, more convenient to express this vector 
mean in polar coordinates. Let @ and ao be a solution of the equations 
= cos ao, f= sin ao. It is well known that this solution is unique 
unless = 7=0, and is given by 


(1.2) a= (+ 592 
(1.3) ao = arctan — 
Z 


where the quadrant in which a» lies must be determined by inspection 
of the signs of and 7. Equation (1.2), which defines the vector strength 
a at the mean direction ao, may be written from (1.1) 


(1.2’) né = / ( > sin an) +:( > cos 2) 
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Finally the mean direction ap satisfies the equation 
(1.3’) > sin (a, — ao) = 0. 


The statistic ao is analogous to the average of a linear variate; the 
meaning of d will be explained later. 

The observations a, may be given individually or, and this is mostly 
the case, in grouped form, i.e. as frequencies p, attributed to certain 
intervals, say months, of equal length.' In the individual case, we cal- 
culate the sine and the cosine for each observation, and obtain a» and 
G from (1.2’) and (1.3). The grouped case permits a simplification. 
Assume that the frequencies for the 12 months are given, and assume 
furthermore months of thirty days each. Then one day corresponds to 
one degree. We attribute the frequencies to the midpoint of each month 
and assume, say, the middle of July as preliminary value for ao. Then 
the observations can be arranged in the following scheme: 


TABLE 1 
SCHEME FOR A CIRCULAR DISTRIBUTION OVER THE YEAR 








North 
360° 
330° June July August 30° 
300° May September 60° 
West 270° April October 90° East 
240° March November 120° 
210° February December 150° 
January 
180° 
South 





The mean direction is obtained from the following trigonometric rules: 


(1.4) >> cos a, =July —Jan. +0.86603 (Aug. — Feb. +-June— Dec.) 


ven] 


+0.5 (Sept. —March+ May —Nov.) 
(1.5) >> sin a,=Oct.—Apr.-+0.86603 (Sept. —March+Nov. — May) 


vom] 


+0.5 (Aug. —Feb.+ Dec. — June). 





1 Whether corrections for length of month are desirable in practical calculations will depend on the 
degree of refinement desired, and on the amplitude of the seasonal movements. When the seasonals are 
of the order of 10 per cent or less, variations in length of month will produce noticeable irregularities. 
February, after all, has 10 per cent fewer days than January. The details of the corrections will be 
given in the second article. 











134 AMERICAN STATISTICAL ASSOCIATION JOURNAL, MARCH 10953 


Of course this procedure can also be applied to any other starting point 
after the appropriate rotations. 
A circular distribution may be graphically represented by a linear 
histogram. Numerous arrangements are possible, including: 
(1) Listing the months in their normal order from January on the left to 
December on the right; 
(2) Placing the modal month in the middle with one-half of the sixth follow- 


ing month at either end; 
(3) Locating the calculated mean (or mode) in the middle. 


A circular distribution may also be plotted on polar coordinate paper. 
Here the usual procedure is to trace the frequencies of the various 
months as radii vectores at the appropriate angles; the result is a 
polar wedge diagram corresponding to the linear histogram. By this 
procedure, however, the area of each wedge varies as the square of the 
frequency, which means that if two different distributions of, say, 100 
observations are plotted, their areas will in general differ: To conserve 
areas, Leighly [8] proposes to trace the square roots of the frequencies, 
and this procedure, henceforth called aequiareal, is used in the follow- 
ing. An obvious alternative is to use aequiareal polar paper, which 
may be constructed by graduating the radii according to a square root 
scale. 

Two limiting cases of circular distributions are immediately evident. 
The first is the uniform distribution, where any angle is as likely to 
occur as any other. Then the density of probability f(a) =} (a meas- 
ured in radians) is obviously independent of a. The cumulative prob- 
ability function—defined in analogy to the linear procedure by dF/da 
= f(a), with the boundary conditions F(0)=0 and F(2x7) = 1—is simply 
F(a) =a/2zx. Pélya used an n-dimensional analogue of this assumption 
when he investigated whether the stars are distributed at random over 
the celestial sphere—also when he calculated the mean distance be- 
tween two points for any number of dimensions [14, 16]. 

The other limiting case occurs when all the angles are the same. This, 
of course, has no practical or theoretical interest, but the asymptotic 
approach to this case is worth discussion. As anexample, aset of angles 
might be normally distributed with mean 0 and standard deviation 1°. 
Then no sensible fraction of the total frequency would be found in most 
of the circle—for example, the arc from 5° to 355° would_be virtually 
void. Asymptotic distributions of this type do not differ sensibly from 
the linear normal. They have been used for the analysis of the Brownian 
movement and the paths of the beta rays [6], and they are the basis 
for Schelling’s work on the most frequent particle paths in two and 
three dimensions [18]. 
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The problem of circular distributions becomes significant wien a sen- 
sible portion of the observations may occur over all or nearly all of the 
circumference. As in the linear case an infinity of circular distributions 
exists. Among them a few, possessing common desirable properties, 
have been studied. The distributions studied are all symmetrical, al- 
though many observed phenomena show asymmetry. Moreover, these 
distributions possess one mode—i.e., one most probable direction. From 
both properties—symmetry and unimodality—it follows that nowhere 
is the probability less than its value at the antipode of the mode— 


0° 


<— CARDIOID 








270° 7 9 


AEQUIAREAL 180° 


GrapH 1. The cardioid and the aequiareal trace of the cardioid distribution. 


henceforth called the antimode. Nothing is lost in generality if we put 
the mode at zero, north on the map, and the antimode at 7, south; 

A simple example of a circular distribution is the sine function modi- 
fied in such a way that the density of probability is non-negative and 
the area under the distribution is unity. This leads to the symmetrical 
distribution 


f(a) =$(1+ cos a). 


The density of probability is 1/ at the mode and zero at the antimode. 
When f(a) is traced in polar coordinates, the resulting curve is the 
cardioid. Graph 1 compares the cardioid trace of f(a) with the aequi- 
areal trace. 
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Other circular distributions can be generated by wrapping an un- 
limited linear distribution around a circumference. This problem has 
been studied by Lévy, Marcinkiewicz, and Wintner [9, 10, 20], who 
have thus treated the normal and the Cauchy distributions. The nor- 
mal wrapped-up distribution with mode zero may be written 





1 too 1 2 
(1.6) & 7 > en (atekr)*/2e* — 4 »» q” costa (0 Sa S 2r) 


V2 tm—o 2r T tml 


where Ing= —o*/2. Graph 2 shows it traced on aequiareal polar scale for 
o=50° and 100°, corresponding respectively to ¢=.68333383 and 
.21608421. When o>77° approximately, this angular distribution may 
be computed from tables of elliptic theta functions. Theoretical advan- 
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Grapx 2. Wintner’s circular distribution in aequiareal scale. 
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tages recommend this distribution; it presents, however, serious dif- 
ficulties, especially if the method of maximum likelihood is to be used 
to estimate its parameters. 


2. THE CIRCULAR NORMAL DISTRIBUTION 


Gauss has shown that the normal distribution can be derived by 
what is now called the principle of maximum likelihood and a single 
assumption that the mean is the most probable value. In 1918 Mises 
[11] applied the method of Gauss to a circular variate and derived the 
distribution that is the subject of this paper. His procedure is essen- 
tially as follows: Mises asks for such a distribution f(¢), where é is an 
“error of observation” ¢,=a,—a, that the ratio of the a posterior: to 
the a priori probability (the likelihood function) of a “true value” a 


upon 7 observations a1, - + - , a, is to be a maximum for that a given by 
equation (1.3’). The likelihood function is 

II sé). 

ven] 


The postulate means that 


¥ Lie — = 


yan] S(a iin ao) 


should hold together with equation (1.3’). Since these two sums are 
equal for arbitrary values of the a,, the equality must hold term by 
term. This functional equation has the solution 


I(&) = Coto ts 
where the two positive parameters C and k are linked by the condition 


0 


"s()dt = 1. 


0 


C=1 / f eomeas 


Pad 
1/221 o(k) 


where Jo(k) is the Bessel function of the first kind of pure imaginary 
argument. Therefore 


Consequently 


ek cos (a—ag) 


— fle) = 
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This expression is henceforth called the circular normal distribution 
since it was derived in a way which is strictly analogous to the classical 
derivation of the linear normal distribution. Pélya [15, pp. 129-132] 
has shown that the distribution (2.1) is the only one that shares certain 
important properties of the linear normal. 

The density function has a mode at ao and an antimode at ao+z. 
The quotient of the densities at the mode and at the antimode is 


(2.2) S(ao)/f(ao + 3) = e*. 


The larger the part of the distribution situated in the neighborhood of 
the mode, the larger is k. Therefore, this parameter k is a measure of 
concentration. For k=0 the distribution (2.1) degenerates into the 
uniform circular distribution. When & is significantly different from 
zero, there is evidence of the existence of a cycle. 

To get further insight into the nature of the parameter k consider 
the points of inflection for the distribution traced on a linear scale. Let 
the mode ap be zero, then the points of inflection a; and az are the 
solutions of 


f(a) =0 
whence from (2.1) 
k sin? a = cos a. 


The larger the parameter k the nearer are the points of inflection to 
the mode. The preceding equation has the real solutions 


(2.3) cos qy = —- — Et 14 ver 7a; = 1. 


For large values of k, the points of inflection are approximately 


(2.3’) as = + 1/Vk 
while for the linear normal distribution, the points of inflection are 
a2:= to. 


If k is large, its reciprocal 1/k influences the circular normal distribu- 
tion in the same way as o? influences the linear normal one. 

This analogy leads to the second degenerate case. Let k be large ; then 
the distribution is highly concentrated about ao. Introduce a reduced 
variate 8 by writing 
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a — ao = B/V/k. 
The distribution of the variate B is 
J(8) = const e* °° (GIVE) 


Since the argument of the cosine is small, we may neglect higher powers 
of 8 in the exponential and obtain a normal distribution 


1 
= —— fe 2/2 
(2.4) f(8) a , 


with mean zero and unit standard deviation. In fact, for large k the 
reduced distribution f(8) converges to this linear normal distribution. 
For the estimation of the parameters consider the likelihood function 


L = (2x)-*I9-*(k) exp ed cos (a, — a) |. 
1 
Equating 0/da9 log L to zero we obtain equation (1.3’). Equating 
0/dk log L to zero we obtain 
To’(k) 
Io(k) 


The expression on the right is easily transformed with the help of 


(1.3’) into 
1 n 2 " 2 
—4/ ( cos a.) + (Xsina) 
n 1 1 
which by (1.2’) equals the vector strength d. Therefore the parameter 


k is the solution of 
(2.5) I,(k) — aIo(k) = 0 





1 n 
= — >> cos (a, — ap). 
nmi 





where 
I,(k) = Io’(k). 


Equation (2.5) was solved for k by interpolating in a table of Bessel 
functions [16a]. Table 2 gives k as a function of the observed vector 
strength ad. Tables for testing the significance of observed @ are under 
preparation. 

K. Arnold [1] has generalized the distribution (2.1) to the bivariate 
case, considering the center of gravity of points on a sphere. He has 
also generalized equation (1.6) to the bivariate case by using Fourier’s 
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TABLE 2 
PARAMETER k AS A FUNCTION OF VECTOR STRENGTH 4 
Ii(k) 
a= 
To(k) 
a k 63 a k 33 
00 0.00000 ok 45 1.01022 +52 
01 .02000 1 46 03889 54 
02 .04001 1 47 06810 57 
03 .06003 1 48 09788 62 
64 .08006 4 49 12828 64 
05 -10013 2 50 15932 69 
06 .12022 3 51 19105 72 
07 . 14034 5 52 22350 77 
08 .16051 5 53 25672 83 
09 18073 6 54 29077 88 
10 0.20101 5 55 1.32570 93 
11 .22134 8 56 36156 100 
12 24175 7 57 39842 107 
13 . 26223 8 58 43635 115 
14 -28279 9 59 47543 123 
15 .30344 10 60 51574 133 
16 .32419 9 61 .55738 142 
17 .34503 12 62 .60044 156 
18 36599 12 .63 -64506 166 
19 .38707 13 .64 69134 183 
20 0.40828 13 .65 1.73945 197 
21 42962 14 .66 78953 216 
22 45110 15 .67 84177 236 
23 47273 17 .68 89637 260 
24 49453 16 .69 95357 285* 
25 51649 18 .70 2.01363 315* 
26 53863 20 71 07685 351* 
27 56097 19 oa 14359 391* 
28 58350 22 Py f° 21425 437* 
29 60625 22 .74 28930 493* 
30 0.62922 23 45 2.36930 558* 
31 65242 25 .76 45490 633* 
32 67587 26 oan 54686 728* 
33 69958 27 .78 64613 837* 
34 72356 29 .79 75382 972* 
35 74783 31 .80 .87129 1136* 
36 77241 31 81 3.00020 1341* 
37 79730 34 .82 . 14262 1598* 
38 82253 36 .83 30114 1918* 
39 84812 37 . 84 47901 2331* 
-40 0.87408 39 .85 3.68041 2860* 
41 .90043 42 .86 .91072 3557* 
42 .92720 43 87 4.17703 4480* 
43 95440 47 
44 . 98207 48 
45 1.01022 52 





®8 has been modified from .69 to .87. 








953 


THE CIRCULAR NORMAL DISTRIBUTION 141 











180 


Grapx 3. The circular normal distribution in aequiareal scale for k =0, 1, 2, 4. 


equation of heat flow, which yields the normal ‘inear equation as a 
special case. 


38. CALCULATION OF TABLES 


The density function (2.1) is relatively simple to calculate from tables 
of the Bessel function, the cosine, and the exponential function. The 
density function is presented to three decimals in the form 


(3.1) ¥(a) = Vek os #/To(k) 


which is adequate for tracing aequiareal frequency ovals on polar 
paper (Table 3). This table was calculated to five decimals and checked 
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by differencing before cutting down to three decimals. It was further 
checked by multiplying together the values corresponding to supple- 
mentary angles. For tracing, the values given in Table 3 must be mul- 
tiplied by »/n/m, where m stands for the number of wedges—in our 
case m= 12. 

Graph 3 traced in aequiareal polar scale shows the influence of the 
parameter k. For increasing values of k the area from 90 to 270 degrees 
shrinks to an insignificant part of the whole area. For the sake of com- 
parison the distributions are also drawn in linear scale in Graph 4. 

The areas of the circular normal distribution function, Table 4, 
posed the main problem of calculation. Originally it was desired to 
compute this table to 7 decimals for k=0.1 (0.1) 4.0 and a=2%° (23°) 


40 





PROBABILITY DENSITY 














heee*. i rt 


00 a L 1 
180° 210° 240° 270° 300° 330° 0° 30° 60° 390 





Grapu 4, The circular normal distribution in linear scale for k =0, 1, 2, 4. 
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180°. This was actually done, and the present table is an abridgment. 
For calculation three methods were considered: quadratures, expan- 
sion in a series of incomplete beta functions, and expansion in a series 
of sines. The sine expansion was finally accepted—partly because it 
was suitable for use with an I.B.M. 602 calculating punch available 
to the authors, and partly because adequate tables of the Bessel func- 
tions I,(k) were obtainable. 

The method of quadratures was suggested in 1950 by Dr. H. R. J. 
Grosch, then of the Watson Scientific Computing Laboratory, Colum- 
bia University. The method has the advantage that the complete in- 
tegral, fore e°* = dz, is obtained without reference to its definition as a 
Bessel function, and the Bessel function is then used as a check. This 
method would be suitable for longhand calculation, and it seemed 
ideally suited to calculation on the card programmed calculator at the 
Watson Laboratory—a possibility briefly considered. The method was 
not, however, well adapted to calculation on the 602 calculating punch. 

The method of incomplete beta functions derives from Mises [11, p. 
499], who gives the formula 


(3.2) f ek este = Y) — cos” zdx 

¢ n=Q 1! 0 
and indicates that it was used for computation by S. Weich. Arnold has 
likewise stated, in conversation with the authors, that he used a formu- 
la equivalent hereto in computing his 6 decimal table [1]. The integral 
on the right side of (3.2) is recognized as the incomplete beta function 


1 , 
= f t-1/2(1 — t)@-Y2 dt, where v = sin? z. 
0 


To evaluate it there are available, besides expansion of the integrand 
as a polynomial in cos mz: 

(1) The tables of Legendre [7, p. 178] available only for n=2, 4, 6. 

(2) The table of Soper [12, p. 179], for z=2/2. This was used in 
preliminary calculations. 

(3) The table of Wishart [12, p. 239]. This was investigated but its 
auxiliary argument, which we shall call ¢, defined by 


x 
£ = 2n'/2 tan —» 
2 


renders it unsuitable for extended calculations for fixed x; moreover it 
fails for n<9. 
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(4) The extensive beta-function table of Pearson [13]. This requires 
the auxiliary variable 2=cos* z, rendering interpolation everywhere 
necessary and requiring, for moderately small z, some such artifice as 
that of Pearson and Stoessiger [12, p. 176]. 

The method of sine series rests on the expansion 


(3.3) ekeoss = To(k) +2 >, 1,(k) cos nz 

n=1 
which was essentially given by Frullani [4, p. 502]. Integration term by 
term gives 


ek cos tdzr 





y ) 
n—I,(k) sin na. 
aI o(k) 2 


a 
--+ 
T 


When the tabulation of the distribution by the above formula was first 
considered, the then most readily available tables of I,(x) were those 
of A. Lodge [5, p. 309]. The interpolation of these for intervals of 0.1 
and the extrapolation by recurrence to 14, which preliminary calcula- 
tion showed would be necessary for 9 decimal accuracy throughout, 
were not works we were prepared to undertake. Then Dr. J. C. P. 
Miller generously provided us with proof sheets of the Royal Society’s 
forthcoming volume of Bessel functions of integral order, proceeding at 
intervals of 0.1-and going up to Js. 

The connection between the two series expansions should be briefly 
noted. We have 


‘a eo -_ kynt2t 
(3.5) Io(k) + 22 Talk) cos nz = ie cos ne 2, Hn +O! 
(eo = 1, eg = 2, n > 0) 


ry kn r) kn (n]2) i — 2i 
(3.6) > —cos*s = DS — $0.0 SN 
n=0 7 n=0 2* t=0 tl(n = t)! 








Hence, if we take the beta function series and neglect terms beyond that 
in k*, it is equivalent to replacing each Bessel function by its ascending 
power series and truncating these at the terms in k". The double series 
converge absolutely, and so we have incidentally proved that the co- 
efficients of the cosine series are the J,; this was not the method used 
by Frullani, who, writing before the modified Bessel functions had 
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been defined, obtained the differential equation satisfied by the co- 
efficient of cos nz by integrating by parts and thence derived the power 
series. 

The use of punch card equipment to calculate series such as (3.4) 
is now fairly widespread. According to Comrie [2], “The first scien- 
tific application of the Hollerith—in 1928—was to the summation of 
harmonic terms, or, in other words, Fourier synthesis.” Comrie’s 
description of this job [3] is still worth reading; the reader should note 
carefully the difference between English and American machines [2, 
pp. 156-157] and remember that the 80-column card is now nearly 
universal. 

In our calculations, values of 2/2Io(k) were determined by hand cal- 
culator to 10 decimals and punched into cards for k= [0.1 (0.1) 4.2]. 
I, and nm were punched; then 2/,,(k)/rIo(k) and 21,(k)/nIo(k) were 
formed successively on the 602 calculating punch. The angles nz, 
1S<nS14, x=[0° (2.5°) 180°], were reduced by hand to the first 
quadrant and the sines were punched to 9 decimals from the National 
Bureau of Standards table [19]. 

The term a/x, being a multiple of 1/72, was calculated on an I.B.M. 
405 tabulator by progressive summation. The sines were multiplied 
by the Bessel function quotients on the 602. Although the summations 
could have been made on this machine also, they were made on a 
tabulator (both 405 and 402 were used) in order to have a printed 
record of the terms; the process differs from that in [2] principally in 
the fact that the tabulators used in our calculations are equipped to 
subtract from the card. A 513 summary punch was connected to 
punch a 9 decimal table of the function (with errors up to 7 units in 
the last place); this table was printed and examined for gross errors. 
It was then checked by forming first to sixth differences, inclusive, in 
both the k- and a-directions, on the 602. From the 9 decimal second 
and fourth differences, in each direction, modified second differences 
were computed by the usual formula 


bm? = 5% — .183936. 


The function and the modified differences in the two directions were 
then rounded to 7 decimals by adding 5 to the eighth decimal. The 
modification and the rounding were checked by summing the cards in 
groups on the tabulator, and the table was then printed. 

The present abridgment, Table 4, was prepared by taking cards cor- 
responding to increments of 0.2 in k and 5 degrees in a and differencing 
them five times in each direction; the fifth differences were used only to 
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control gross errors in the fourth. The modification, rounding, check- 
ing, and printing then proceeded as with the 7 decimal table, the 5 
being added, of course, in the sixth place. 


4. SUMMARY 


The mean direction ao and the vector strength d of circular data as 
used by climatologists are calculated from (1.2’) and (1.3). The same 
statistics may be used for data other than weather. With data grouped 
in monthly class intervals the calculations of these statistics are facili- 
tated by (1.4) and (1.5). The assumption that the maximum likelihood 
estimate of a “true value” shall be given by the mean direction leads 
to the circular normal distribution (2.1). The central location parame- 
ter of this distribution is estimated by the mean direction a». The 
parameter k is estimated from @ by means of Table 2. The radii for 
plotting the distribution in aequiareal polar scale are obtained by 
multiplying the values given in Table 3 by »/n/12, where » stands 
for the total frequency. Areas of the distribution are given in Table 4. 
Numerical examples will be given in a forthcoming article. 
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BOOK REVIEWS 


Mathematics Essential for Elementary Statistics. Revised Edition. Helen M. 
Walker (Professor of Education, Teachers College, Columbia University). New 
York: Henry Holt and Company, 1951. Pp. xiii, 382. $2.75. 


Leo A. Goopman, University of Chicago 


nis book, a self-teaching manual for the adult who has forgotten or 
poe grasped elementary mathematics, still remains without serious 
competitors. 

The first edition (1934) of this book was reviewed by Herbert Toops in 
this Journal, Volume 30 (1935), pp. 483-84. This revised edition contains 
new sections dealing with simple operations with common and decimal 
fractions, inequalities, linear interpolation, permutations and combinations, 
the binomial expansion, the multinomial expansion, ratios and proportions, 
trigonometric identities, second degree equations, degrees of freedom, and 
weights and measures. Many of the explanations appearing in the 1934 edi- 
tion have been expanded. In order to keep the book small (the first edition 
contained 246 pages) all those sections of the first version which related 
specifically to the content of statistical method have been eliminated. 

Every topic discussed is accompanied by self-scoring exercises, answers 
for which are provided in a key at the back of the book. Some of tinese answers 
(e.g., p. 9 problem 14, p. 34 problem 11 (2), p. 46 problems 4 and 5, and 
others) are incorrect, but this should not confuse the student who has 
mastered the topic. 


Elements of Statistics. C. G. Lambe. London: Longmans, Green and Company, 
1952. Pp. viii, 112. 


Cart F. Kossack, Purdue University 


1 po author sets forth the object of this text as follows: ‘‘This work is 
written primarily for the use of engineering students, but it is hoped that 
it will be a useful introduction to the Theory of Statistics for scientists 
and experimentalists of all kinds.’’ Such a goal is a very ambitious one and 
one which has interested the reviewer for some time. It embraces the difficult 
question of whether it is necessary to have separate introductory courses in 
statistics for each major area or if a single first course will suffice. Since this 
text also includes a section on Quality Control, one finds himself faced with 
the further problem of whether this phase of statistics can be combined with 
other phases or needs to be treated in an independent course. Unfortunately, 
the text being reviewed does not help much if one is interested in answering 
such questions. 

In fact, the text is so brief, consisting of only 104 small pages, that one 
feels that many of its weaknesses can be attributed to this limitation in 
space. The material covered falls into three major catagories: elementary 
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statistical methods, the theory of errors, and quality control. The approach 
lacks inspiration. It follows the usual outline: frequency distributions, 
measures of dispersion, continuous distributions, etc., each chapter being 
very restricted in its nature, consisting of around ten pages including the 
exercises. The large number of exercises associated with each chapter is to 
be commended, but one cannot help but feel that they have been introduced 
either as arithmetic projects or for mathematical gymnastics rather than to 
develop the student’s appreciation of statistics. 

The chapter on the theory of errors is the most worthwhile one in the book. 
The material in this chapter deserves being considered for inclusion in more 
courses in statistics than it is today. It is apparent that the eight pages used 
by the author to discuss quality control are not enough to do the subject 
justice. 

One could mention particular difficulties encountered in reading certain 
sections. Even the first sentence in the book gives rise to some question, 
“Statistics may be defined as the scientific treatment of numerical data.” It is 
apparent that the statistician will soon inherit the earth. Not enough atten- 
tion is given to explaining the real purpose behind the many statistical ideas 
and measures that are introduced. For example, the concept of independence 
of two variables is never really explained, let alone defined, although it is 
used. 

An improvement in the text could be achieved if more space and attention 
were given to the sections on the theory of errors and quality control, with 
the other material restricted to an introductory section stressing more the 
basic concepts of statistics, especially those needed to appreciate the work 
covered in these two areas. 


Introducci6n a los métodos de la estadistica. 1* Parte. Sizto Rios. Madrid: 1952. 
Pp. 205. Paper. 


Paut R. Haumos, University of Chicago 


His is a charming and elementary book that fulfills, within the limits the 

author sets for it, the promise of the title. The mathematical level of the 
book is not advanced; nothing more high-powered than integral calculus is 
ever used. The topics discussed are: statistical tables, graphical representa- 
tion, frequency and probability, sampling, random variables, multidimen- 
sional statistics, correlation, regression, sampling distributions, and tests of 
hypotheses. Although at times the definitions are a little vague (cf. the defini- 
tions of statistical variable and random variable on pp. 35 and 36), the large 
number of examples, complete with detailed tables and graphs, is likely to 
give the reader a sound intuitive grasp of the subject. A notable feature of 
the book (emphasized by Herman Wold in his preface) is the number, 
variety, and interest of the exercises. The reader’s statistical imagination 
will probably be stimulated when he is asked whether meat consumption in 
Madrid can be adequately studied by using samples of telephone subscribers 
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(p. 8), and when he is requested to perform various intricate coin-tossing 
experiments that illustrate the possible presence of correlation in multi- 
dimensional situations (p. 97). As a pre-technical introduction, with the 
purpose of telling the experimenter what ideas he is likely to encounter in 
his study of statistics and what statistics is likely to do for him, the book is 
highly recommended. 


Statistik Tidskrift. Nr 1, 1952. Stockholm: Statistiska Centralbyrin, Januari 
1952. Pp. 83. Paper. 

TATISTIK Tidskrift published by the Central Bureau of Statistics of Sweden 
S is a new journal whose main purpose is to give summaries of current sta- 
tistics at an earlier date than official statistical reports usually do. It will 
also contain articles on new statistical techniques, international statistical 
collaboration, reviews of books in statistics, and so on. 

Among the articles in this first issue the following two are of special inter- 
est to statisticians, in general: one by H. Cramér on statistical methodology 


and the other by T. Dalenius on operations analysis. 
U. G. 


A Guide to Tables of the Normal Probability Integral, National Bureau of Stand- 
ards Applied Mathematics Series 21, Washington, U.S. Government Printing 
Office, 1952, 15 cents. Pp. iv, 16. 


Tz. Guide” its Introduction says, “makes available in one place certain 


information that will be helpful in utilizing the normal probability tables 
given in standard statistical texts and other important sources. ... The aim 
has not been to present an exhaustive or historical compilation but rather, to 
include the more important sources likely to be accessible in libraries of 
American universities, research organizations, etc., as well as some of the 
more recent publications that educational institutions, research organiza- 
tions, and students of statistics might be encouraged to acquire.” There are 
62 entries in the bibliography. 

The tables are classified into two main categories, direct and inverse, 
according to whether the probability is shown as a function of the unit 
normal deviate, or the deviate as a function of the probability. Each category 
is divided into five groups, according to the kind of probability tabulated: 
central, semi-central, two-tail, single-tail, or cumulative. ‘Parts I and IT tell 
which sources give tables for the indicated types of segments of area under 
the normal curve and give the essential characteristics of the tables. These 
features are succeeded by a description of interpolation methods (Part III) 
especially applicable to certain of the tables mentioned. Then follow two 
brief comments [i.e., two long sentences]... giving the relationship be- 
tween some types of normal tables” and the incomplete gamma and chi- 


square tables. 
W. A. W. 
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Consumers’ Price Index. Report of a Special Subcommittee of the Committee 
on Education and Labor Pursuant to H. Res. 73, Subcommittee Report No. 2, 
House of Representatives, 82nd Congress, 1st Session, 1951, 39 pp. 


JuLes BackMAN, New York University 


ARLY in 1951, the United Electrical, Radio, and Machine Workers of 

America published a study containing extensive criticisms of the Con- 
sumers’ Price Index. This study followed the pattern of the Meany-Thomas 
Report which was used to denounce the Bureau of Labor Statistics and the 
Consumers’ Price Index during World War II. To evaluate these charges 
and to determine the present adequacy of the Consumers’ Price Index, a 
special Subcommittee of the House Committee on Education and Labor 
was established. After hearing a number of witnesses, this Committee pre- 
pared the report under review. 

This report is an excellent brief survey of how the Consumers’ Price Index 
is constructed, its use, and its history. The Committee found that the UE 
charges were without merit and that the Consumers’ Price Index “is an 
excellent index.” The Committee made several recommendations which ap- 
pear to be solidly based. Thus, for example, they recommended that the 
BLS make periodic studies of “changes in living costs due to changes in 
non-price factors” and “the extent of changes in prices paid by wage earners 
and lower salaried clerical workers who moved from one community to 
another.” Such studies would be useful particularly during periods char- 
acterized by major upheavals in the economy such as those which attend 
an armament program. The Committee properly suggested that the Con- 
sumers’ Price Index should not incorporate these types of changes. 

The Committee also reviewed the suggestion that income taxes should be 
included in the index. It recommended a continuation of the present practice 
under which the BLS includes excise and sales taxes, but excludes income 
taxes. All citizens have a responsibility to pay their fair share of the cost 
of government. If income taxes were included in the Consumers’ Price 
Index, the net result would be to exempt from such responsibility all workers 
who are covered by escalator clauses or who received wage increases based 
upon changes in the Consumers’ Price Index. The adoption of a general 
sales tax, however, would create serious problems in this connection and 
would require a reexamination of the BLS method of handling such taxes. 

The Committee also recommended that BLS should follow “a policy of 
keeping the index continuously under review, with revisions as required 
when important changes in buying habits occur, rather than infrequent 
complete revisions as has been the practice in the past.” If such a continuous 
review were made, there would be little need for the major revisions of the 
index such as the Bureau is now engaged in making. The Committee properly 
suggests that BLS “should be extremely cautious against making too fre- 
quent changes of the index which are minor in character.” 

The Committee has recognized that to effectuate the changes it suggests 
would require adequate financial support for the Bureau. This in itself is 4 
contribution since very often such committees make recommendations 
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without being willing to face the fact that their execution would cost money. 
It is very healthy to have periodic reviews of an index as important as the 
CPI. The Steed Committee is to be congratulated for a very workmanlike 


job. 


A Dollar Index of Soviet Machinery Output, 1927-28 to 1937. Alexander Ger- 
schenkron, assisted by Alezander Erlich. Santa Monica: The Rand Corporation, 
April 6, 1951. Pp. v, 352. Paper. Not published for sale; 2 limited number of 
copies are available on request from interested scholars. 


JoHn CRAWFORD, United Nations 


, ear study is mainly devoted to the construction and appraisal of a new 
index based on the physical quantities of 128 items of machinery pro- 
duced in Russia between 1927-28 and 1937 weighted with 1939 U. S. factory 
prices. This unusual index is described by the author as “part of a compre- 
hensive recomputation of the rate of growth of industrial output in Russia 
... justified by the manifold deficiencies of the official Soviet index of 
industrial production.” 

Briefly, it is charged that the official Soviet indexes have grossly over- 
stated the real rate of growth of industrial output. The machinery index is 
regarded as exemplifying this tendency since it is felt to reflect in exaggerated 
form the main weakness of the Soviet indexes, namely, the introduction of 
new commodities at approximately their current prices during the period of 
general inflation subsequent to 1926-27, the weight year of the indexes. 
This results in an upward bias because the output of the new items expanded 
faster than the old. Thus an examination of the index computed by Gerschen- 
kron reveals that 24 items were first included in 1931 and 21 in 1932. The 
value of these items in 1937 (at 1939 U. S. prices) was 3.5 to 4 times their 
value in the year they were first included, whereas the total index only 
doubled during the same period. 

As evidence of the charge that the “1926-27 prices” of machinery are 
inflated (presumably in the sense that they contain prices of later years for 
commodities not produced in 1926-27), the author refers to data presented 
in the Soviet Plan for 1941. This source contains values of planned output 
for 1941 at current and at 1926-27 prices for a number of different Ministries. 
These indicate that the 1941 planned output at current prices for the elec- 
trical and other machinery industries exceeds that in 1926-27 prices by only 
10 per cent, whereas 1941 values for the ferrous metal, coal, and woodwork- 
ing industries are more than double the values in 1926-27 prices. It is con- 
cluded that the similarity of values at 1941 and 1926-27 prices for the ma- 
chinery industries (in which the increase in physical production is known to 
have been large) is explained by the fact that so-called 1926-27 prices con- 
tain prices of subsequent years for commodities not produced in 1926-27. 
The author rejects as a sufficient explanation cost reductions in the machin- 
ery industry (said to have amounted to 45 percent over the period of the 
Second Five-year Plan, 1933-1937) on the ground that this reduction was 
insufficient as well as on the basis of statements of Soviet writers. 
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In addition to the effect of the general price inflation, the weights are 
believed to be inflated also because the prices used for the new commodities 
are said to be those which prevailed in the initial period of mass production 
before the usual increases in productivity took place. 

Aside from the problem of new commodities, the use of 1926-27 nrices 
as weights is considered to have imparted an upward bias to the index be- 
cause increases in output and productivity are positively correlated. Thus 
the use of beginning-period prices as weights would result in an index showing 
a larger rate of growth than the use of end-of-period prices. This proposition 
is tested with U. S. price and quantity data for machinery items common 
to the years 1899 and 1909, 1899 and 1923, 1909 and 1923, and 1909 and 
1939: indexes computed with beginning-period prices as weights show from 
two to seven times as large an increase as when end-of-period prices are 
used. The longer the period of comparison the greater is the relative increase. 

To get around the problem caused by the “process of changing scarcity 
relationships” involved in the economic development of a country, and 
lacking a consistent set of Soviet prices for machinery, the author used 1939 
U.S. price relationships. His choice of U. S. prices is defended on the grounds 
of their availability and the fact that their use permits a comparison of the 
absolute levels of U. S. and Soviet machinery production. The prices used 
were drawn from Census of Manufactures data and from American business- 
men, who having dealt with Russia were able to identify and price American 
counterparts of Soviet machinery items. Use of the Census limited the period 
chosen to census years and from these 1939 was selected as being the most 
recent available for the purpose, data for 1947 being regarded as distorted 
by the war. According to the author, “The problem of using as index weights 
prices of a different country is essentially the same as that of basing an index 
of a country’s output over a given period of industrialization alternatively 
on prices pertaining to the beginning and to the end of a period.” It would 
seem, then, that if the Russian machinery industry during the 1930’s is 
regarded as having been at the same stage of economic development as, for 
instance, that in the U. S. just prior to the first Worid War, the use of 1914 
U. S. prices rather than 1939 would have more nearly approximated Soviet 
scarcity relationships. 

The fact that in 1937 the Soviet index exceeded 3.2 times the index com- 
puted by Gerschenkron (1927-28=100) is explained by (1) an unknown 
amount of downward bias in the latter measure; (2) pricing errors of the 
Soviet index; and most important of all, (3) the effects of using as weights 
for the computed index prices of a year (1939) that approximate the end of 
the period (1937) and are prices of a more highly developed economy. As the 
author states, “The use of American prices of the year 1939 is an altogether 
sut generis way of looking at Soviet Output. The index of Computed Output, 
therefore, is not a gauge by which the Soviet Indexes can be adjusted.” 

Accordingly, the charge of bias in the Russian index still seems to rest 
on the validity of these propositions: that there was a substantial degree of 
inflation in Soviet machinery prices after 1927-28, and that in a period of 
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economic development the use of base period weights leads to a higher index 
than weights based on end-of-period prices. 

Whether the reader will agree with the author’s conclusion that “with all 
its deficiencies thé [Gerschenkron] index should still provide a possible basis 
for gauging the development of a crucial branch of Russia’s industrial 
economy” will depend on the meaning attached to measures of industrial 
production. Aside from the pricing deficiencies of the Soviet index the ques- 
tion of the choice of beginning- or end-of-period weights must rest on the 
use of the indexes. As to the use of U. S. prices as weights, it would appear 
that the meaning of a production measure is anchored in its economic 
setting so that whatever phenomenon the new index can be said to measure, 
it is not the physical volume of Russian machinery output. 

Students of production indexes will be indebted to Professor Gerschenkron 
not only for the general clarity of his description but also for the 288 pages 
of appendices which set forth the basic data and computations underlying 
the study. May the Russians take heed. 


Business Fluctuations. Robert A. Gordon. New York: Harper and Brothers, 
1952. Pp. xvi, 624. $5.00. 


Henry H. Vitiarp, The City College, New York 


ROFESSOR Gordon has written an able and unusual volume on business 

fluctuations, placing main emphasis on actual cycles and minimizing the 
space devoted to cycle theory. The material is divided into three parts: 
the first, covering background understanding and tools of analysis; the sec- 
ond, the nature and causes of fluctuations; and the third, prediction and 
control. 

Part I starts with a discussion of the income and transaction versions of 
the quantity equation, and of national income, gross national product, and 
related concepts. Next the determinants of aggregate demand are considered. 
The relationship between consumption and investment is examined at 
length, both in theoretical, ex ante terms and in the light of statistical, ex post 
evidence, and the theory of the multiplier is analyzed. The first part ends 
with a discussion of fluctuations in investment and the role of the accelera- 
tion principle. The conclusion is that the acceleration principle—either as 
an explanation of investment fluctuations or in conjunction with the multi- 
plier as an explanation of entire cycles—is a weak reed without noticeable 
empirical support, and that the Keynesian determinants via liquidity pref- 
erence and the supply of money are of no more practical use. The detailed 
subsequent discussion is therefore necessary if fluctuations are to be under- 
stood. 

The second part, on the nature and causes of fluctuations, constitutes the 
core of the volume. It starts with a presentation of the meaning and available 
measures of business activity, including aggregative measures such as the 
GNP, bank debits, and employment and index numbers such as those of 
the Federal Reserve and the American Telephone Company. Next the char- 
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acteristics of private-enterprise economies making for secular and cyclical 
change are set forth, with population growth, technological developments, 
alterations in tastes, “structural changes,” and wars placed in the former 
category and production for profit, price-cost relationships, and the use of 
unstable money in the latter. Stress is laid on the inevitable tendency 
toward fluctuations under these conditions, which the much emphasized 
relationship between saving and investment summarizes but does not ex- 
plain. The kinds of economic change which result are set forth, including 
seasonal, secular, cyclical, and irregular movements. The nature and past 
behavior of secular movements is covered, together with Kondratieff long 
waves (which are considered to be unconfirmed) and building cycles (which 
are considered significant in connection with business cycles of unusual 
severity). This clears the way for a presentation of how the economy behaves 
during business cycles, which relies heavily upon the studies of the National 
Bureau of Economic Research (whose method of measuring the cycle is 
described at this point), and leads in turn to a discussion of how the economy 
generates business cycles. The most original part of the presentation is a 
welcome distinction between minor and major depressions: the former are 
likely to result from over-expansion of inventories or overproduction in 
particular areas and be ended by liquidation of inventories or absorption of 
the specific excess capacity; the latter are likely to result from impairment 
of long-term investment opportunities and be ended by accumulated needs 
to replace worn-out capital and renewed expansion of growth industries, 
with building activity sometimes playing an important initiating role. 

Two of the last four chapters of Part II are devoted to business cycle 
theories and two to a description of American economic developments from 
1919 to the Korean War. The theorists discussed are grouped as “business- 
economy” theorists (Mitchell, Veblen, Pigou, and Metzler), monetary 
theorists (Hawtrey, Wicksell, and Warburton), a “capital-shortage” group 
(Hayek, Spiethoff, and Cassel), a “partial-overinvestment” group empha- 
sizing the exhaustion of specific investment opportunities (Schumpeter, 
Hansen, and Robertson), and a group stressing overinvestment in rela- 
tion to final demand (Clark, Harrod, Kalecki, and Hicks). Underconsump- 
tionists and weather theorists sre mentioned briefly, while an appendix 
is devoted to econometric model-builders. The final two chapters describing 
actual fluctuations in detail are in many ways the most original and inter- 
esting in the book. That they are urgently needed is obvious, now that so 
many students have no recollection of anything except the prosperity that 
has prevailed since World War II. The presentation contains many sugges- 
tions as to the factors responsible for the fluctuations that occurred without 
sacrificing balance in the description of the events themselves. 

The final section of the volume is devoted to prediction and control. Two 
broad methods of forecasting business activity are distinguished: historical 
patterns and “cross-section analysis.’”’ In the first group only the statistical 
indicators of the National Bureau are found to be promising, but various 
methods of projecting the GNP and its components (and relating them to 
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an industry and ultimately to a firm) are found to be worthwhile. Inter- 
national aspects of instability are also discussed in this section, including 
the way in which fluctuations are transmitted and the measures needed to 
reconcile internal and external stability. But the bulk of the section is 
devoted to the achievement of domestic stability, including not only full 
employment but also the elimination of significant price changes. The extent 
to which these objectives can be realized by controlling consumption and 
private investment, by using monetary and fiscal policies, and by influencing 
wages and prices are discussed at length. A final chapter summarizes the 
extent to which governments in the United States and elsewhere are today 
formally committed to achieve full employment and the actual preparations 
that have been made. In attaining stability major importance is attached 
to fiscal policy, but the broad setting in which such policy must operate and 
the way in which other policies can be used to help is also made clear. 
Throughout, the emphasis on how to achieve stability is unusual and will 
be welcome to those whose students insist on knowing what can be done 
about economic fluctuations. 

Professor Gordon’s volume is a solid and substantial achievement. If I 
have any reservation, it is that the book is a bit too solid for its original 
objective of providing a text for an upper-division course required of all 
business administration majors at the University of California. Almost all 
texts dealing with business cycle theories have felt an obligation to include 
theories which have been discredited professionally; Professor Gordon seems 
to have felt a similar obligation to include almost everything that business 
cycle economists have been doing in recent years, whether or not the results 
have been particularly fruitful. The result is a book with quite unequalled 
references to the periodical literature which should be of unique value to the 
graduate students just starting in or to professional economists who have 
not kept up with recent developments. But I believe—though it is clearly a 
matter of judgment —that the volume would have been of greater value to 
undergraduates if perhaps a third or so of the material had been omitted 
and more emphasis placed on crucial areas of analysis and policy. 

To illustrate: most of the tools considered in the first section are never 
subsequently used to justify the space (and student energy) devoted to their 
development. Again in Part II I believe that the 140 pages leading up to 
how cycles are generated might well have been condensed in the interest of 
an elaboration of the forces involved, which are presented rather sketchily. 
I feel, for example, that too little stress is laid on the increasing postpona- 
bility (with little sacrifice of current satisfaction or production) of expendi- 
tures on consumer and producer durables, investment, and repiacement at 
the end of a period of prosperity, and on the increasing inability to postpone 
expenditures (without immediate repercussions on satisfaction or produc- 
tion) after an extended depression. The discussion of fiscal versus monetary 
policy in Part III also seems to me to reflect an unawareness—admittedly 
one widely prevalent at the present time—of the fact that fiscal policy is 
generally subject to the same limitations as monetary policy in checking 
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an inflation when a commitment to stabilize government bonds at any 
narticular level has been accepted—except perhaps to the extent that a 
budget surplus increases saving and so permits a lower level of interest rates 
to be maintained without inflation. Certainly, holding of the proceeds of a 
surplus in idle Treasury deposits, which Gordon recognizes is usually neces- 
sary if the surplus is to be fully deflationary during a period of prosperity, 
will in most cases have the same repercussions on the government bond 
market as a similar reduction of deposits in the hands of the public through 
the use of monetary policy. Finally, I wish that Gordon had discussed price 
stability somewhat more realistically. Despite a preference for constant 
prices in the long run (p. 487) and a suggestion that stabilization may re- 
quire “some sort of supervision” over wage and price policies (p. 498), the 
subsequent detailed discussion (pp. 551-558) offers little except education 
of ‘abor leaders and the possibility of letting unemployment rise as high as 
6 or 7 per cent. Actually constant prices mean that average wages, including 
fringe benefits, cannot increase as much as a nickel an hour a year.' Given 
such facts as the steady increase in wages from 1933 to 1937, when unemploy- 
ment was far higher than 7 per cent, and Mr. Lewis’s recent characteri- 
zation of an effort to cut his miners back to a wage increase of rather more 
than 10 per cent as exhibiting a sadistic “penchant for robbing miners’ 
babies of life-giving milk,” I feel that more discussion of just how labor 
leaders are to be educated would be of educational value to all of us! 

But these are largely matters of preference and emphasis. The sort of 
book I would like for undergraduates doesn’t exist, and, the proof of the 
pudding being in the eating, I shall be using Gordon’s volume in my under- 
graduate business cycles course next semester. 


Private and Public Investment in Canada, 1926-1951. O. J. Firestone. Ottawa: 
Department of Trade and Commerce, 1951. 4°. Pp. 254. Paper. 


MituarpD Hastay, National Bureau of Economic Research 


HERE should be something of value in this complex and formidable 
monograph for every serious student of investment. 

In origin, it is a source book of Canadian fixed capital expenditures in the 
period 1926-51; and this aspect of the work finds expression in Part II, 
which consists of 64 quarto-size pages of tabular material on Canadian in- 
vestment flows, and Part III, which comprises 38 pages of explanatory 
material on concepts and definitions, sources and estimating techniques, and 
the quality of the estimates. The impressive thing about this side of the 
work is its scale. The estimates cover all outlays on durable productive assets 
by every sector of the economy. This means not only gross investment, but 
also repairs and maintenance, and not only by the business community, but 
also by such institutions as churches, schools, universities and hospitals, by 
governments both on durable physical assets and on development and con- 





1 Assuming a 2 per cent annual increase in productivity per man hour and no redistribution of in- 
come or increase in unemployment. As dividends are about 5 per cent of wages and salaries, oppor- 
tunities for redistribution are severely limited. 
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servation of natural resources, and by individuals on housing. The figures 
are shown separately for the private and public sectors, and also for in- 
dustrial groupings based on the Canadian Standard Industrial Classification, 
defined to include publicly as well as privately owned establishments. 
Finally, both gross investment and repairs and maintenance are divided 
into outlays for construction and outlays for machinery and equipment. 
Most of these distinctions can, in turn, be crossed; and several of them, such 
as “the public sector” or industrial groupings such as “manufacturing” and 
“utilities,” are themselves multiple classifications. The end product is a set 
of 123 tables—each containing, on the average, four non-duplicating time 
series—which presents a picture of Canadian investment that cannot be 
matched either in comprehensiveness or in detail by published estimates 
covering the United States. These estimates are of course of varying quality, 
and every user must assure himself that their defects are not critical for his 
purposes. But they have been put together on a systematic plan and appear 
from the descriptions to be at least as reliable as the best comparable esti- 
mates compiled for the United States. 

More than half the book is taken up by Part I, which constitutes a 133- 
page interpretative summary of the stark facts contained in the statistical 
sections. This summary is comprehensive rather than intensive, but it adds 
up to a highly instructive account of the growth and changing structure of 
the Canadian economy as disclosed in a quarter-century of investment ex- 
perience. Every reader of this summary will be impressed by the author’s 
wide knowledge of Canadian economic development and by his sure sense 
of the special factors which serve to diversify investment patterns in different 
regions and industries. And if the reader is familiar with United States 
experience, his attention will be arrested by a number of striking parallels. 
For the diversity of Canadian resources is scarcely less than that of our 
own, and the Canadian people are embarked on a course of industrialization 
which recapitulates much of our economic history at a substantially stepped 
up pace. Thus, in the last quarter century, manufacturing has gone ahead 
of agriculture in the number of persons employed and now contributes 
roughly three times as much to national income. Utilities, too, have under- 
gone a phenomenal expansion until, in relation to population, Canada’s 
railroad, telephone, and electric power networks rank among the largest and 
most advanced in the world. And, finally, Canada has pioneered in working 
out viable relationships between free private enterprise and a federal 
system of government. In all of these respects Canadian experience supple- 
ments our own, and Dr. Firestone’s monograph should find a grateful audi- 
ence among students of investment in this country. 


Psychological Analysis of Economic Behavior. George Katona. New York: 
McGraw-Hill Book Company, 1951. Pp. ix, 347. $5.00. 
Maraaret G. Rew, University of Chicago 


—_ author has set for himself the task of describing a psychological 
approach to economic analysis and the current research in the field of 
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economic behavior,” to be done in such a manner as to reach “the genera] 
public as well as the expert.” (p. v.) To the reviewer the book has more of 
the earmarks of a text for undergraduate students than a monograph for 
experts, or of a book for lay readers, however interested they may be “in 
present-day American economic life.” Furthermore it appears to deal with 
specialized research rather than what can be described as “the current 
research.” 

The book is divided into five parts: Part I. Problems and Tools; Part II. 
Consumer Behavior; Part III. Business Behavior; Part IV. Economic 
Fluctuations; and Part V. Research Methods. Part II contains more than 
one third of the pages, and almost all the statistical data presented about 
economic behavior. Parts III and IV deal with concepts, theories and 
generalizations based on non-statistical observations, despite the author’s 
judgment that “statements not susceptible of empirical validation have no 
place in psychology.” (p. 29.) Unsubstantiated and in fact highly dubious 
statements occur; for example, “Business and labor leaders have become 
increasingly aware of their responsibility to the public as well as of the fact 
that their own interests can best be promoted by striving for what is in the 
general interest.” (p. 294.) One chapter in Part I is devoted to “What kind 
of psychology?” In this the student is presumably to be introduced to the 
“solidly established tenets of modern psychology.” (p. 8.) However, the 
psychological ideas actually used in later discussion are so much a matter of 
common knowledge that the person who omits this chapter is not likely to 
feel handicapped. 

Part V dealing with Research Methods comprises 34 pages. Attention 
centers on the investigation of attitudes rather than of behavior as such and 
types of schedules and questions and their use in getting reports on attitudes. 
The text ends with two pages devoted to “research projects” in which the 
author emphasizes the need for examining in so far as possible the effect of 
one factor at a time. He expresses the belief that considerable progress has 
been made in this direction. He is, however, very vague or silent as to the 
extent to which a systematic field of knowledge has developed in the psy- 
chology of understanding economic behavior or the concrete gaps that exist 
and should be a challenge to research workers. 

With minor exceptions data presented in Part II are those provided by 
the Consumer Finance Surveys conducted by the Survey Research Center 
of the University of Michigan for the Federal Reserve Board. Out of 34 for- 
mal tables 32 present data from this source, one describes the components 
of the national budget in selected years and one presents hypothetical data. 
These tables furnish a useful digest of the findings of the Consumer 
Finance Surveys and include some data not previously published. 

The author states at the outset that “it will be shown in the book that 
studying the motives, attitudes, and expectations of consumers and business- 
men contributes to the understanding of spending, saving, and investing. 
... The results of psychological-economic studies will supplement the 
traditional analysis of supply, demand, income and consumption.” (p. 4.) 





— 


ae ma _- Shiites ae & ff 2am @ «FD 


BOOK REVIEWS 165 


The expectations raised by considerable emphasis at the outset on “motives, 
attitudes and expectations” are likely to be disappointed for many readers. 
Out of 32 tables presenting findings from the Consumer Finance Surveys 
only 11 include data on attitudes, motives and expectations. The author 
does not, however, devote much space to considering the unique contribu- 
tion of the two types of data. He does emphasize that people are able to 
predict change in future income but little or no recognition is given to the 
fact that the incomes of individuals and of families tend to move through 
well-defined cycles so that individuals tend to be able to predict the course 
of their income even when unable to predict the course of national income. 
The author does state “Perhaps primarily, the analyst can rely on the 
results of cross tabulations between forms of behavior and certain charac- 
teristics of people.” (p. 75.) The context would seem to imply that the 
author had in mind things other than “motives, attitudes and expectations,” 
although contact with certain sources of information is included in charac- 
teristics. The data used have a uniqueness with respect to characteristics 
of people that for many will be the major attraction of this book. Whereas 
the earlier analyses of consumption and current savings in relation to eco- 
nomic levels were largely confined to the effect of income of one year the 
analysis presented here in addition includes income of two years, extent of 
change in income and assets and especially in liquid assets. The author is, 
however, handicapped in his analysis of numerous variables by the necessity 
of using a small national sample. 

Because of the excessive attention to variations in behavior factors 
among individuals readers may feel that they lose sight of a major fact of 
interest—namely the mass response to certain types of change. Some of the 
problems of aggregation are touched on but only in a rather tentative fashion. 

The book is marred by occasional carelessly worded or superfluous state- 
ments. Some confusion, for example, exists in the meaning of “fact” and 
“knowledge” and “response” of interviewees. It would hardly seem nec- 
essary to make an extensive investigation to disclose that “people’s mo- 
tives, or needs, desires and hopes... were conflicting.” (p. 73). Nor does 
it seem necessary to hypothecate after the accumulated observation of 
human history, not to mention the development of philosophy, that “some 
of our expenditure may be found to be nothing but expressions of whims or 
the results of emotions.” (p. 69.) In view of the classification of money spent 
on durable goods as expenditures the reader is likely to be a little puzzled 
at the statement that “the net sum of those relatively infrequent outlays 
which are not used for the satisfaction of immediate needs is considered 
saving.” (p. 151.) We are also told that “There were people who felt that they 
were seriously affected by the ‘depression’—mostly people with low incomes.” 
(p. 281.) One is left to wonder whether they were people with low incomes 
because of the depression. After a general discussion of “probable reactions 
to policy” without any empirical evidence the author concludes that “it is 
often possible to differentiate between appropriate and inappropriate re- 
sponses to economic policy and to economic change.” (p. 288.) 
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